1 Introduction

A hypergraph is a pair \((X,\mathcal {E})\) where X is a finite set and \(\mathcal {E}\) is a finite multiset of subsets of X. Elements of X are called vertices and elements of \(\mathcal {E}\) are called edges. If \(|E|=k\) for each \(E \in \mathcal {E}\), then the hypergraph is said to be k-uniform. A covering design is a hypergraph \((X,\mathcal {E})\) with the property that each pair of elements of X occur together in at least one edge in \(\mathcal {E}\). Sometimes the vertices of a covering design are referred to as points and its edges as blocks. A weighting of a set S is an assignment of nonnegative real values to the elements of S. For such a weighting w we abbreviate \(\sum _{s \in S'}w(s)\) to \(w(S')\) for each subset \(S'\) of S. We say such a weighting w is normalised if \(w(S)=1\).

In situations where every item in a data set must be compared with every other item in the set, it may be desirable to store the data across a number of machines in such a way that any two data items are stored together on at least one machine [\(L(D')=L(D)\) where \(D'\) is a covering design obtained from D by adding a new block that is a subset of an existing block. So L(m) is nonincreasing in m.

To illustrate, we discuss L(m) for \(m \in \{1,2,3,4\}\). In any covering design \((X,{\mathcal {B}})\) with \(|{\mathcal {B}}| \in \{1,2\}\), it can be seen that the entire point set X must be a block in \({\mathcal {B}}\). From this it follows that \(L(m)=1\) for \(m \in \{1,2\}\). Here are examples with three and four blocks.

Example 1.1

Let \(D=(X,{\mathcal {B}})\) be the covering design with point set \(X=\{1,2,3\}\) and block set \({\mathcal {B}}=\{\{1,2\},\{1,3\},\{2,3\}\}\). Let w be the normalised weighting of X given by \(w(x)=\frac{1}{3}\) for each \(x \in \{1,2,3\}\). Then \(w(B)=\frac{2}{3}\) for each \(B \in {\mathcal {B}}\) and hence \(L(D,w)=\frac{2}{3}\). It follows from Theorem 1.3 below that \(L(3) \geqslant \frac{2}{3}\), and hence we have \(L(3)=\frac{2}{3}\).

Example 1.2

Let \(D=(X,{\mathcal {B}})\) be the covering design with point set \(X=\{1,2,3,4\}\) and block set \({\mathcal {B}}=\{\{1,2\},\{1,3\},\{1,4\},\{2,3,4\}\}\). Let w be the normalised weighting of X given by \(w(1)=\frac{2}{5}\) and \(w(x)=\frac{1}{5}\) for each \(x \in \{2,3,4\}\). Then \(w(B)=\frac{3}{5}\) for each \(B \in {\mathcal {B}}\) and hence \(L(D,w)=\frac{3}{5}\). It follows from Theorem 1.5 below that \(L(4) \geqslant \frac{3}{5}\), and hence \(L(4)=\frac{3}{5}\). Since L(m) is nonincreasing in m, this implies that \(L(m) \leqslant \frac{3}{5}\) for each integer \(m \geqslant 4\).

A linear space is a covering design in which each pair of points occur together in exactly one block. For a positive integer s, a projective plane of order s is an \((s+1)\)-uniform linear space with \(s^2+s+1\) points and \(s^2+s+1\) blocks (note that we allow projective planes of order 1, even though they are sometimes excluded as trivial). For an integer \(s \geqslant 2\), an affine plane of order s is an s-uniform linear space with \(s^2\) points and \(s^2+s\) blocks. It is known that, for each order \(s \geqslant 2\), either both of these planes exist or neither do, and further that both exist if s is a power of a prime (see [15, §2.3], for example). An almost projective plane of order s is an \((s+1)\)-uniform covering design with \(s^2+s\) points and \(s^2+s\) blocks in which each point is in exactly \(s+1\) blocks. Almost projective planes of orders 2 and 3 exist but no example of order greater than 3 has been found (see [2, 3]).

We summarise some of the main results of [10] in the following theorem.

Theorem 1.3

[10] For each positive integer m,

(1)

Furthermore, we have equality in (1) if \(m=s^2+s+1\) and there exists a projective plane of order s or if \(m=s^2+s\) and there exists an affine plane of order s.

In this paper we will further the investigation initiated in [10] in a number of ways. In Section 2, we investigate values of L(D) subject to the requirement that D is a covering design of a certain specific type and establish data limits in the case of transversal designs and finite projective Hjelmslev planes. In Section 3, we observe some connections between the data limit and previously studied parameters, namely covering numbers and fractional matching numbers. In Section 4 we investigate the cases where \(m \in \{s^2+s,s^2+s+1\}\) for some positive integer s and investigate when equality in (1) is possible. We prove the following theorem and, in addition, we also characterise the covering designs that can achieve these bounds (see Lemmas 4.6 and 4.9 for details).

Theorem 1.4

  1. (a)

    For each positive integer s, we have \(L(s^2+s+1)=\frac{s+1}{s^2+s+1}\) if and only if a projective plane of order s exists. Furthermore, if D is a covering design with \(s^2+s+1\) blocks and at most \(s^2+s+1\) points such that \(L(D)=\frac{s+1}{s^2+s+1}\), then D is a projective plane of order s.

  2. (b)

    For each integer \(s \geqslant 2\), we have \(L(s^2+s)=\frac{1}{s}\) if and only if either an affine plane of order s exists or an almost projective plane of order s exists. If D is a covering design with \(s^2+s\) blocks and at most \(s^2\) points such that \(L(D)=\frac{1}{s}\), then D is an affine plane of order s.

In Sect. 5, we note that Theorem 1.3 establishes the asymptotics for the data limit and, further, we provide a stronger lower bound on the data limit.

Theorem 1.5

For each integer \(m \geqslant 2\), we have

(2)

where s is the positive integer such that \(s^2-s+1 < m \leqslant s^2+s+1\). Furthermore, if we have equality in (2), then \(m=4\) or \(m \in \{s^2+s,s^2+s+1\}\).

We will see in Lemma 5.2 that (2) agrees with (1) when \(m \in \{s^2+s,s^2+s+1\}\) and is strictly stronger for all other values of m. In Section 6 we give some concluding thoughts and discuss some open problems.

2 Data limits for certain classes of covering designs

In addition to finding bounds on L(m) (which we will do in later sections of this paper), we believe it is also of interest to find bounds on L(D) for various classes of covering designs. We investigate such bounds in this section.

Let \(H=(X,{\mathcal {E}})\) be a hypergraph. Throughout the paper, for each \(x \in X\), we denote \(\{E \in {\mathcal {E}}:x \in E\}\) by \({\mathcal {E}}_x\) and \(|\mathcal {E}_x|\) by \(r_x\). In hypergraph terms \(r_x\) is the degree of vertex x, while in design-theoretic terminology \(r_x\) is sometimes called the replication number of x. We say that H is regular if the value \(r_x\) is the same for all \(x \in X\). We say that H is intersecting if \(E_1 \cap E_2 \ne \emptyset \) for all \(E_1,E_2 \in {\mathcal {E}}\). For a covering design D, our first result gives a general method for finding a lower bound on L(D) by choosing a certain weighting of the blocks of D.

Lemma 2.1

Let \(D=(X,{\mathcal {B}})\) be a covering design and h be a weighting of \({\mathcal {B}}\) such that \(\sum _{B \in {\mathcal {B}}_x}h(B) \geqslant 1\) for each \(x \in X\). Then

$$\begin{aligned} L(D) \geqslant \frac{1}{\sum _{B \in {\mathcal {B}}}h(B)}. \end{aligned}$$

Proof

Let w be a weighting of X such that \(L(D,w)=L(D)\). We have

$$\begin{aligned} L(D)\sum _{B \in {\mathcal {B}}}h(B) \geqslant \sum _{B \in {\mathcal {B}}}\biggl (h(B)\sum _{x \in B}w(x)\biggr ) \geqslant \sum _{x \in X}w(x) = 1 \end{aligned}$$

where the first inequality follows because \(\sum _{x \in B}w(x) \leqslant L(D)\) for each \(B \in {\mathcal {B}}\) and the second follows because \(\sum _{B \in B_x}h(B) \geqslant 1\) for each \(x \in X\). \(\square \)

We observe the following useful corollary of Lemma 2.1.

Corollary 2.2

Let \(D=(X,{\mathcal {B}})\) be a covering design. If there is a submultiset \({\mathcal {B}}'\) of \({\mathcal {B}}\) and a positive integer t such that each point in X occurs in at least t blocks in \({\mathcal {B}}'\), then \(L(D) \geqslant t/|{\mathcal {B}}'|\).

Proof

Apply Lemma 2.1 with \(h(B)=\frac{1}{t}\) if \(B \in {\mathcal {B}}'\) and \(h(B)=0\) otherwise. \(\square \)

We can use Corollary 2.2 to exactly determine the data limit for coverings that are both uniform and regular and to give some general bounds relating to the replication numbers and block sizes of a covering design.

Lemma 2.3

If \(D=(X,{\mathcal {B}})\) is a covering design, then

$$\begin{aligned}\frac{\min \{r_x:x\in X\}}{|{\mathcal {B}}|} \leqslant L(D) \leqslant \frac{\max \{|B|:B \in {\mathcal {B}}\}}{|X|}.\end{aligned}$$

In particular, \(L(D)=\frac{r}{|X|}\) if \(r_x=r\) for all \(x \in X\) and \(|B|=|B'|\) for all \(B,B' \in {\mathcal {B}}\).

Proof

Let \(v=|X|\), \(m=|{\mathcal {B}}|\), \(r_{\min }=\min \{r_x:x\in X\}\) and \(k_{\max }=\max \{|B|:B \in {\mathcal {B}}\}\). Applying Corollary 2.2 with \({\mathcal {B}}'={\mathcal {B}}\) shows that \(L(m) \geqslant \frac{r_{\min }}{m}\). Furthermore, \(L(D,w) = \frac{k_{\max }}{v}\) where w is the normalised weighting of X given by \(w(x)=\frac{1}{v}\) for each \(x \in X\). This proves the first part of the theorem.

Now suppose that, for some integers r and k, \(r_x=r\) for all \(x \in X\) and \(|B|=k\) for all \(B \in {\mathcal {B}}\). Then \(\frac{r}{m} \leqslant L(D) \leqslant \frac{k}{v}\) from the first part and \(rv=km\) by counting point-block incidences. So \(L(D)=\frac{r}{m}\). \(\square \)

Note that Lemma 2.3 generalises results from [10] concerning balanced incomplete block designs. A balanced incomplete block design is a uniform linear space. Such designs necessarily have exactly r blocks containing each point for some fixed r and so the second part of Lemma 2.3 applies to them. We now establish another lower bound on L(D) based on the replication numbers of the points of a covering design D.

Lemma 2.4

If \(D = (X,{\mathcal {B}})\) is a covering design such that \(L(D)<1\) and \(\sigma = \sum _{x \in X} \frac{1}{r_x-1}\), then

Proof

Note that \(r_x \geqslant 2\) for each \(x \in X\) because otherwise \(X \in {\mathcal {B}}\) and \(L(D)=1\). For each \(B \in {\mathcal {B}}\), define

We want to apply Lemma 2.1. Therefore we need to show that \(\sum _{B \in {\mathcal {B}}_x}h(B) \geqslant 1\) for each \(x \in X\). Let \(x \in X\); then

where the inequality follows because there are \(r_x\) blocks containing x and, for every \(y \in X \setminus \{x\}\), there is at least one block containing x and y. So we do indeed have \(\sum _{B \in {\mathcal {B}}_x}h(B) \geqslant 1\).

The result now follows by Lemma 2.1 because

\(\square \)

We give a small example to illustrate the application of some of the lemmas we have just proven.

Example 2.5

Let \(D=(X,{\mathcal {B}})\) be the covering design with \(X=\{1,2,3,4,5\}\) and

$$\begin{aligned} {\mathcal {B}}=\bigl \{\{1,2\},\{1,3\},\{1,4,5\},\{2,3,4\},\{2,3,5\}\bigr \}. \end{aligned}$$
(3)

We have \(r_1=r_2=r_3=3\) and \(r_4=r_5=2\), and hence Lemma 2.3 gives \(L(D) \geqslant \frac{2}{5}\). Lemma 2.4 improves on this by giving \(L(D) \geqslant \frac{9}{17}\). This can be improved still further by applying Lemma 2.1 with block weights \(\frac{1}{5},\frac{1}{5},\frac{3}{5},\frac{2}{5},\frac{2}{5}\) respectively according to the order the blocks are listed in (3). This weighting makes the sum of the block weights on each point exactly 1, and hence Lemma 2.1 implies that \(L(D) \geqslant \frac{5}{9}\). In fact, we have \(L(D)=\frac{5}{9}\) because \(L(D,w)=\frac{5}{9}\) for the weighting of X given by \(w(1)=\frac{1}{3}\), \(w(2)=w(3)=\frac{2}{9}\) and \(w(4)=w(5)=\frac{1}{9}\).

As we just saw, Lemma 2.3 and Lemma 2.4 give bounds based on natural parameters of the covering design. Lemma 2.1 potentially yields stronger bounds, but this is reliant on finding a suitable choice of weighting for the blocks.

We noted that data limits for balanced incomplete blocks designs were considered in [10] and are covered by Lemma 2.3. We will conclude this section by considering data limits for two other classes of covering design: transversal designs and finite projective Hjelmslev planes. Both of these classes are closely related to the class of finite projective planes.

A (kn)-transversal design is a linear space \((X,{\mathcal {B}})\) such that \(|X| = kn\) and \({\mathcal {B}}\) consists of

  • k blocks of size n that partition X; and

  • \(n^2\) other blocks of size k.

Note that a projective plane of order n can be obtained from an \((n+1,n)\)-transversal design by adding a fixed new point to each of the blocks of size n. In general a (kn)-transversal design exists if and only if there are \(k-2\) mutually orthogonal latin squares of order n (see [15, §6.6], for example). The results we have proved so far in this section make it easy to determine the data limits of transversal deigns.

Theorem 2.6

If D is a (kn)-transversal design with \(k \leqslant n\), then \(L(D) = \frac{1}{k}\).

Proof

Note that D is a covering design with kn points and blocks of sizes k and n. Hence \(L(D) \leqslant \frac{n}{kn} = \frac{1}{k}\) by Lemma 2.3. For the lower bound, we apply Corollary 2.2 with \({\mathcal {B}}'\) taken to be a set of k blocks of size n in \({\mathcal {B}}\) that partition X. This shows that \(L(D) \geqslant \frac{1}{k}\). \(\square \)

A (tq)-projective Hjelmslev plane can be defined as an intersecting covering design \((X,{\mathcal {B}})\) such that there exists a projective plane \((P,{\mathcal {L}})\) of order q, a partition \(\{X_p:p \in P\}\) of X and a partition \(\{{\mathcal {B}}_L:L \in {\mathcal {L}}\}\) of \({\mathcal {B}}\) with the following three properties.

  • For all \(L \in {\mathcal {L}}\) and \(p \in P\), we have \(|L \cap X_p|=t\) if \(p \in L\) and \(|L \cap X_p|=0\) otherwise.

  • Any two points in X that occur together in more than one block are in the same class in \(\{X_p:p \in P\}\).

  • Any two blocks in \({\mathcal {B}}\) that intersect in more than one point are in the same class in \(\{{\mathcal {B}}_L:L \in {\mathcal {L}}\}\).

It is known that for any (tq)-projective Hjelmslev plane, we have \(|X|=|{\mathcal {B}}|=t^2(q^2+q+1)\), \(|B|=t(q+1)\) for each \(B \in {\mathcal {B}}\) and \(r_x=t(q+1)\) for each \(x \in X\) (see [7], for example). A (1, q)-projective Hjelmslev plane is simply a projective plane of order q. From Lemma 2.3, we immediately have the following.

Theorem 2.7

If D is a (tq)-projective Hjelmslev plane, then \(L(D) = \frac{q+1}{t(q^2+q+1)}\).

It is not fully known for which values of t and q there exists a (tq)-projective Hjelmslev plane, but it is known that a (qq)-projective Hjelmslev plane exists whenever a projective plane of order q exists [11, 12]. We briefly consider the example of a (2, 2)-projective Hjelmslev plane.

Example 2.8

From the immediately preceding discussion, there exists a (2, 2)-projective Hjelmslev plane. Such a plane is a covering design \(D=(X,{\mathcal {B}})\) such that \(|{\mathcal {B}}|=|X|=28\) and \(|B|=6\) for each \(B \in {\mathcal {B}}\). We have \(L(D) = \frac{3}{14}\) by Theorem 2.7. This establishes that \(L(28) \leqslant \frac{3}{14} \approx 0.2143\). Theorem 1.5 establishes \(L(28) \gtrapprox 0.2095\), improving on Theorem 1.3 which gives \(L(28) \geqslant \frac{1}{5}= 0.2\).

3 Relationship to other parameters

In this section we observe a relationship between the data limit and two other previously studied combinatorial parameters.

The dual of a hypergraph is the hypergraph whose vertex set is \({\mathcal {E}}\) and whose edge set is \(\{{\mathcal {E}}_x:x \in X\}\). Thus the dual is obtained by interchanging the role of vertices and edges. Observe that a hypergraph is intersecting if and only if its dual is a covering design. We say a hypergraph is simple if no two of its edges are incident with exactly the same set of vertices (note that if a hypergraph is not simple then its dual will contain multiple vertices incident with exactly the same set of edges). We say a weighting w is rational if it assigns only rational weights.

A fractional matching in a hypergraph \(H=(X,\mathcal {E})\) is a weighting w of \(\mathcal {E}\) such that \(\sum _{E \in \mathcal {E}_x}w(E) \leqslant 1\) for each \(x \in X\). The size of w is \(\sum _{E \in \mathcal {E}}w(E)\). The fractional matching number \(\nu ^*(H)\) of H is the maximum size of a fractional matching in H. Because \(\nu ^*(H)\) is the optimal value of a linear program, it is clearly well-defined for any hypergraph H. In [8], Füredi studied fractional matching numbers of intersecting hypergraphs. We first exhibit the relationship between these and data limits.

Lemma 3.1

  1. (a)

    For any covering design D, we have that \(L(D)=1/\nu ^*(H)\) where H is the dual of D. Furthermore, \(L(D)=L(D,w_0)\) for some rational normalised weighting \(w_0\) of the points of D.

  2. (b)

    For any positive integer m, \(L(m)=1/\nu ^*(m)\) where \(\nu ^*(m)\) is the supremum of \(\nu ^*(H)\) over all intersecting hypergraphs H with m vertices. Furthermore, \(L(D)=L(D,w_0)\) for some covering design D with m blocks and some rational normalised weighting \(w_0\) of the points of D.

Proof

We first prove (a). Say \(D=(X,{\mathcal {B}})\) and let \(H=({\mathcal {B}},\{{\mathcal {B}}_x : x \in X\})\) be the dual of D. Note that H is intersecting. Because \(\nu ^*(H)\) is the optimal value of a linear program with integer coefficients, there is a fractional matching of size \(\nu ^*(H)\) in H in which each edge is assigned a rational weight. This induces a rational weighting \(w^*\) of the point set of D such that \(w^*(B) \leqslant 1\) for each \(B \in {\mathcal {B}}\). Then, for the rational normalised weighting \(w_0\) of X given by \(w_0(x)=w^*(x)/\nu ^*(H)\) we have \(L(D,w_0) \leqslant 1/\nu ^*(H)\). Thus \(L(D) \leqslant 1/\nu ^*(H)\).

Now, consider a normalised weighting w of X such that \(w(B) \leqslant L(D)\) for each \(B \in {\mathcal {B}}\). There is a fractional matching \(w^*\) of H of size 1/L(D) defined by \(w^*({\mathcal {B}}_x)=w(x)/L(D)\). So \(\nu ^*(H)\geqslant 1/L(D)\) or, equivalently, \(L(D) \geqslant 1/\nu ^*(H)\). Thus, in view of the preceding paragraph, we have that \(L(D)=1/\nu ^*(H)\) for any covering design D, and that \(L(D)=L(D,w_0)\) for some rational normalised weighting w of the points of D. So we have proved (a).

The first part of (b) follows directly from (a) and the definition of L(m). There must be a hypergraph \(H_0\) with \(\nu ^*(H_0)=\nu ^*(m)\) because there are only finitely many nonisomorphic simple hypergraphs H with m vertices and it suffices to consider only these in the definition of \(\nu ^*(m)\). Applying (a) to the dual of \(H_0\) completes the proof of (b). \(\square \)

Note that it follows from Lemma 3.1 that the infimums in the definitions of L(D) and L(m) are in fact minimums. Using the fact that the fractional matching number of a hypergraph is the optimal value of a linear program, Füredi [8] observed the following.

Lemma 3.2

[8] Let \(H=(X,{\mathcal {E}})\) be an intersecting hypergraph. Then, there is a subset \(\mathcal {E}'\) of \(\mathcal {E}\) such that \(|\mathcal {E}'| \leqslant |X|\) and \(\nu ^*(H')=\nu ^*(H)\), where \(H'\) is the intersecting hypergraph \((X,\mathcal {E}')\).

This allows us to give an equivalent definition of the ATAC data limit which is sometimes easier to work with.

Lemma 3.3

For each positive integer m, L(m) is the minimum value of L(D) over all covering designs \(D=(X,{\mathcal {B}})\) with \(|X|=|{\mathcal {B}}|=m\).

Proof

Lemma 3.2 implies that the quantity \(\nu ^*(m)\) defined in Lemma 3.1(b) can be equivalently defined as the maximum value of \(\nu ^*(H)\) over all intersecting hypergraphs H with m vertices and at most m edges. Since edges may be given weight 0 in a fractional matching this is clearly also the maximum value of \(\nu ^*(H)\) over all intersecting hypergraphs H with m vertices and exactly m edges. The result now follows from Lemma 3.1(b). \(\square \)

We now discuss a relationship with another well-studied design theoretic parameter. For positive integers k and v with \(k \leqslant v\), the covering number C(vk, 2) is the minimum number of blocks in a k-uniform covering design with v points. In [14], Mills considers the quantity

for given values of m, showing that this maximum does indeed exist and computing it exactly for positive integers \(m \leqslant 13\). Our next lemma shows that L(m) is in fact the reciprocal of this quantity.

Lemma 3.4

For each positive integer m, it holds that

(4)

Proof

Let k and v be integers such that \(1 \leqslant k \leqslant v\) and \(C(v,k,2) \leqslant m\). Suppose that \(D=(X,{\mathcal {B}})\) is a k-uniform covering design with \(|{\mathcal {B}}|=m\) and that w is the normalised weighting of X such that \(w(x)=\frac{1}{v}\) for each \(x \in X\). Then

So \(L(D) \leqslant \min \{\frac{k}{v}:k,v \in {\mathbb {Z}}, 1 \leqslant k \leqslant v \text { and } C(v,k,2) \leqslant m\}\) by definition.

We now prove the opposite inequality. We do not retain any notation from the first part of the proof. By Lemma 3.1(b), there is a covering design \(D=(X,{\mathcal {B}})\) with \(|{\mathcal {B}}|=m\) and a rational normalised weighting w of X such that \(L(D,w) = L(m)\). Let v be the least positive integer such that \(v \times w(x)\) is an integer for each \(x \in X\). Let \(X'\) and \({\mathcal {B}}'\) be obtained from X and \({\mathcal {B}}\) by replacing, in X and in each \(B \in {\mathcal {B}}\), each point x with \(v \times w(x)\) new points. Let \(w'\) be the normalised weighting of \(X'\) such that \(w'(x)=\frac{1}{v}\) for each \(x \in X'\). It is not difficult to see that \(D'=(X',{\mathcal {B}}')\) is a covering design with \(|X'|=v\) and \(|{\mathcal {B}}'|=m\) and that \(L(D',w')=L(D,w)\). Let k be the size of a largest block in \({\mathcal {B}}'\) and let \(D''=(X,{\mathcal {B}}'')\) be a k-uniform covering design where \({\mathcal {B}}''\) is obtained from \({\mathcal {B}}'\) by replacing each \(B \in {\mathcal {B}}\) with a superset of B of size k. Note that \(C(v,k,2) \leqslant m\) by the existence of \(D''\). Then

Thus \(L(m) \geqslant \min \{\frac{k}{v}:k,v \in {\mathbb {Z}}, 1 \leqslant k \leqslant v \text { and } C(v,k,2) \leqslant m\}\). This completes the proof. \(\square \)

Via Lemma 3.4, the aforementioned computations of Mills in [14] immediately give us the values of L(m) listed in the following table. These values have also been computed independently by Kelly (Private communication, 2017) in the course of the research that led to [10].

figure a

4 The cases \(\varvec{m=s^2+s}\) and \(\varvec{m=s^2+s+1}\)

In [10] it was shown, for a positive integer s, that \(L(s^2+s+1)=\frac{s+1}{s^2+s-1}\) if a projective plane of order s exists. Also, \(L(s^2+s)=\frac{1}{s}\) if an affine plane of order s exists. In this section we show that the converse of the first of these statements holds and establish a partial converse for the second. Furthermore, we give a characterisation of the covering designs that can achieve these bounds.

The following two simple lemmas will be of use to us throughout this section and the next. We say that a weighting is positive if it never assigns a weight of 0.

Lemma 4.1

If \(D=(X,{\mathcal {B}})\) is a covering design and w is a normalised weighting of X, then

$$\begin{aligned}|{\mathcal {B}}|\,L(D,w) \geqslant \sum _{x \in X}r_xw(x).\end{aligned}$$

Proof

This follows immediately from the observations that \(\sum _{B \in {\mathcal {B}}}w(B) = \sum _{x \in X}r_xw(x)\) and that each of the m blocks in \({\mathcal {B}}\) has weight at most L(Dw). \(\square \)

Lemma 4.2

Let \(D=(X,{\mathcal {B}})\) be a covering design with \(|{\mathcal {B}}|=m\), and let w be a normalised weighting of X such that \(L(D,w)<1\). For each \(x \in X\) we have \(r_x \geqslant 2\) and

(5)

Furthermore, if w is positive, then we have equality in (5) only if \(w(B)=L(D,w)\) for each \(B \in {\mathcal {B}}_x\) and each \(y \in X \setminus \{x\}\) is in exactly one block in \({\mathcal {B}}_x\).

Proof

Let \(x \in X\). We have \(r_x \geqslant 2\) because otherwise, since D is a covering, the unique block containing x would be X and hence L(Dw) would be 1. Now the points in \(X \setminus \{x\}\) have total weight \(1-w(x)\) and each is in at least one block containing x. Hence,

$$\begin{aligned} \sum _{B \in {\mathcal {B}}_x}w(B) \geqslant 1-w(x)+r_xw(x) = 1+(r_x-1)w(x). \end{aligned}$$
(6)

So, because \(w(B) \leqslant L(D,w)\) for each \(B \in {\mathcal {B}}_x\), we have

(7)

Rearranging this gives (5).

Furthermore, to have equality in (5), we must have equality throughout (7) and hence in (6). Equality in the first inequality in (7) implies that \(w(B)=L(D,w)\) for each \(B \in {\mathcal {B}}_x\). If w is positive, equality in (6) implies that each \(y \in X \setminus \{x\}\) is in exactly one block in \({\mathcal {B}}_x\). \(\square \)

For a covering design \(D=(X,{\mathcal {B}})\) and a point x in X, we say the covering design \(D'=(X',{\mathcal {B}}')\) is obtained from D by removing x if \(X'=X \setminus \{x\}\) and \({\mathcal {B}}'=\{B \setminus \{x\}: B \in {\mathcal {B}}\}\) (note that \(|{\mathcal {B}}'|=|{\mathcal {B}}|\) and \({\mathcal {B}}'\) contains each block in \({\mathcal {B}}\setminus {\mathcal {B}}_x\)). If a covering design \(D''\) is obtained from D by iteratively applying this operation some number of times (including zero) we say it is obtained from D by removing points. A point x in a covering design D is said to be duplicated if there is another point in X that incident with exactly the same set of blocks. Recalling the definition of a simple hypergraph, a covering design has no duplicated points if and only if its dual is simple.

Remark 4.3

For any covering design D and normalised weighting w of the points of D, we can obtain a design \(D'\) with no duplicated points and a normalised positive weighting \(w'\) of \(D'\) such that \(L(D',w')=L(D,w)\). This can be accomplished by first removing any points of D with weight 0 and then iteratively removing duplicated points until no more remain, at each stage transferring any weight on the removed point to another point incident with the same set of blocks.

We will use the following special case of a result of Füredi [8, Corollary 1].

Theorem 4.4

[8, Corollary 1] Let \(H=(X,\mathcal {E})\) be a simple intersecting hypergraph and define \(c=\max \{|E|-1:E \in \mathcal {E}\}\). Then \(\nu ^*(H) \leqslant c\) unless H is a projective plane of order c.

Combining this with Lemma 3.1, it is easy to show the following.

Lemma 4.5

Let c be a positive integer and let D be a covering design with no duplicated points such that each point occurs in at most \(c+1\) blocks. If \(L(D) < \frac{1}{c}\), then D is a projective plane of order c.

Proof

Suppose that \(L(D) < \frac{1}{c}\). By our hypotheses, the dual H of D is a simple intersecting hypergraph in which each edge has size at most \(c+1\). Also \(\nu ^*(H)>c\) by Lemma 3.1(a) because \(L(D) < \frac{1}{c}\). So by Theorem 4.4, H is a projective plane of order c. The dual of a finite projective plane is another finite projective plane of the same order, so D is also a projective plane of order c. \(\square \)

We can now prove a result which establishes Theorem 1.4(a) and also characterises the covering designs that can achieve the bound.

Lemma 4.6

Let s be a positive integer, let \(D=(X,{\mathcal {B}})\) be covering design with \(s^2+s+1\) blocks and no duplicated points, and let w be a positive normalised weighting of X such that \(L(D,w)=\frac{s+1}{s^2+s+1}\). Then D is a projective plane of order s.

Proof

Suppose that \(D=(X,{\mathcal {B}})\) is a covering design with \(|{\mathcal {B}}|=s^2+s+1\) and no duplicated points, and that w is a positive normalised weighting of the points in X such that \(L(D,w)=\frac{s+1}{s^2+s+1}\). So \(L(D)=\frac{s+1}{s^2+s+1}\) by Theorem 1.3. We must have \(r_x \geqslant s+1\) for each \(x \in X\) because otherwise, for some \(x \in X\), we would have \(r_x\, L(D,w) < 1\) and hence \(w(x) < 0\) by Lemma 4.2. So, since \(L(D,w)=\frac{s+1}{s^2+s+1}\) and w is positive, it must be the case that \(r_x = s+1\) for each \(x \in X\) by Lemma 4.1. Thus, by Lemma 4.5, D is a projective plane of order s because \(L(D,w)<\frac{1}{s}\). \(\square \)

We now turn our attention to proving Theorem 1.4(b). In this case we will make use of a result due to Bierbrauer, Marcugini and Pambianco [2, Corollary 1] that extends Theorem 4.4.

Theorem 4.7

[2, Corollary 1] Let c be a positive integer and let \(H=(X,\mathcal {E})\) be a simple intersecting hypergraph such that \(\nu ^*(H) = c\) and \(|E| \leqslant c+1\) for each \(E \in \mathcal {E}\). Then either \((X,\mathcal {E})\) is an almost projective plane of order c or there is a subset \(\mathcal {E}'\) of \(\mathcal {E}\) such that the dual of the hypergraph \((X,\mathcal {E}')\) is an affine plane of order c.

Lemma 4.8

Let c be a positive integer and let D be a covering design with no duplicated points such that each point occurs in at most \(c+1\) blocks. If \(L(D) = \frac{1}{c}\), then either D is an almost projective plane of order c or an affine plane of order c can be obtained from D by removing points.

Proof

Suppose that \(L(D) = \frac{1}{c}\). By our hypotheses, the dual H of D is a simple intersecting hypergraph in which each edge has size at most \(c+1\). Also \(\nu ^*(H)=c\) by Lemma 3.1(a) because \(L(D) = \frac{1}{c}\). So by Theorem 4.7, the dual of an affine plane of order c can be obtained from H by removing edges or H is an almost projective plane of order c. In the former case we immediately have that an affine plane of order c can be obtained from D by removing points. In the latter case, it follows from our definition of a finite almost projective plane that its dual is another finite almost projective plane of the same order, so D is also an almost projective plane or order c. \(\square \)

The next lemma establishes Theorem 1.4(b) and also characterises the covering designs that can achieve the bound. It also shows that \(L(m) > \frac{1}{s}\) for positive integers m and s with \(m < s^2+s\), which will be useful in our proof of Theorem 1.5 in Section 5.

Lemma 4.9

Let \(s \geqslant 2\) be an integer and let \(D=(X,{\mathcal {B}})\) be a covering design such that \(L(D) \leqslant \frac{1}{s}\). Then \(|{\mathcal {B}}| \geqslant s^2+s\). Furthermore, if \(|{\mathcal {B}}|=s^2+s\), D has no duplicated points and w is a positive normalised weighting of X such that \(L(D,w) = \frac{1}{s}\), then either an affine plane of order s can be obtained from D by removing points or D is an almost projective plane of order s.

Proof

Suppose w is a normalised weighting of X such that \(L(D,w) \leqslant \frac{1}{s}\). Let \(X^*={\{x \in X:w(x)>0\}}\). For each \(x \in X^*\), we must have \(r_x \geqslant s+1\) (otherwise we would have \(w(x) \leqslant 0\), using Lemma 4.2 and \(L(D,w) \leqslant \frac{1}{s}\)). Thus, since \(L(D,w) \leqslant \frac{1}{s}\), Lemma 4.1 implies that

(8)

and it follows that \(|{\mathcal {B}}| \geqslant s^2+s\).

Now further suppose that \(|{\mathcal {B}}|=s^2+s\), D has no duplicated points and w is positive. Then \(X^*=X\) and we also must have equality throughout (8). It follows that \(L(D,w)=\frac{1}{s}\) and \(r_x = s+1\) for each \(x \in X\). So \(L(D)=\frac{1}{s}\) by Theorem 1.3. By Lemma 4.8, either an affine plane of order s can be obtained from D by removing points or D is an almost projective plane of order s. \(\square \)

It is now not difficult to prove Theorem 1.4 from Lemmas 4.6 and 4.9.

Proof of Theorem 1.4

We first prove part (a). If there exists a projective plane of order s, then \(L(s^2+s+1)=\frac{s+1}{s^2+s+1}\) by Theorem 1.3. Conversely, suppose that \(L(s^2+s+1)=\frac{s+1}{s^2+s+1}\). Then there is a covering design \(D=(X,{\mathcal {B}})\) with \(|{\mathcal {B}}|=s^2+s+1\) and a normalised weighting w of X such that \(L(D,w)=\frac{s+1}{s^2+s+1}\). By Remark 4.3, by (if necessary) removing points from D, we can obtain a covering design \(D'=(X',{\mathcal {B}}')\) with no duplicated points and a normalised positive weighting \(w'\) of \(D'\) such that \(L(D',w')=\frac{s+1}{s^2+s+1}\). Then, by Lemma 4.6, \(D'\) is a projective plane of order s. Furthermore, if \(|X|=s^2+s+1\), then, noting that we also have \(|X'|=s^2+s+1\) since \(D'\) is a projective plane of order s, it must be the case that \(X=X'\) and \(D=D'\).

Now we prove part (b). If there exists an affine plane of order s, then \(L(s^2+s)=\frac{1}{s}\) by Theorem 1.3. Also, if there exists an almost projective plane of order s, then assigning each of its points weight \(\frac{1}{s^2+s}\) shows that \(L(s^2+s)=\frac{1}{s}\) in view of the lower bound from Theorem 1.3. Conversely, suppose that \(L(s^2+s)=\frac{1}{s}\). Then there is a covering design \(D=(X,{\mathcal {B}})\) with \(|{\mathcal {B}}|=s^2+s\) and a normalised weighting w of X such that \(L(D,w)=\frac{1}{s}\). As in the proof of (a), by (if necessary) removing points from D, we can obtain a covering design \(D'=(X',{\mathcal {B}}')\) with no duplicated points and a normalised positive weighting \(w'\) of \(D'\) such that \(L(D',w')=\frac{1}{s}\). Then, by Lemma 4.9, either an affine plane of order s can be obtained from \(D'\) by removing points or \(D'\) is an almost projective plane of order s. So either an affine plane of order s or an almost projective plane of order s exists. Furthermore, if \(|X|=s^2\), then, noting that we also have \(|X'|=s^2\) since \(D'\) is an affine plane of order s, it must be the case that \(X=X'\) and \(D=D'\). \(\square \)

We noted in the introduction that almost projective planes of orders 2 and 3 exist but no example having order greater than 3 has been found (see [2, 3]). The following example uses an almost projective plane of order 3. It emphasises that there do exist covering designs D that have \(s^2+s\) blocks and \(L(D)=\frac{1}{s}\), but which are not affine planes.

Example 4.10

Consider the covering design \(D=(X,{\mathcal {B}})\) where \(X={\mathbb {Z}}_{12}\) and \({\mathcal {B}}=\{\{i,1+i,4+i,6+i\}:i \in {\mathbb {Z}}_{12}\}\). Then \(|X|=|{\mathcal {B}}|=12\) and \(r_x=4\) for each \(x \in X\), so D is an almost projective plane of order 3. Because \(L(m)=\frac{1}{3}\) by Theorem 1.3, assigning each point weight \(\frac{1}{12}\) shows that \(L(D)=\frac{1}{3}\).

Bruck and Ryser [5] famously showed that projective planes of certain orders cannot exist.

Theorem 4.11

[5] If a projective plane of order s with \(s \equiv 1,2 \pmod {4}\) exists, then s is a sum of two squares.

Blokhuis, Jungnickel and Schmidt [3, Proposition 1.2] used results of Bose and Connor [4] to prove a similar result for almost projective planes.

Theorem 4.12

[3] Suppose an almost projective plane of order s exists and let \(m = \left( {\begin{array}{c}s+1\\ 2\end{array}}\right) \). When \(s \equiv 0,3 \pmod {4}\), we have that \(s+1\) is a square and, further, if \(m \equiv 2 \pmod {4}\), then \(s-1\) is a sum of two squares. When \(s \equiv 1,2 \pmod {4}\), then \(s-1\) is a square, and there are integers x, y and z such that

$$\begin{aligned}(s+1)x^2 + (-1)^{m(m-1)/2}\, 2y^2 = z^2.\end{aligned}$$

For a given positive integer s, the consequences of Theorem 1.4 vary according to the state of knowledge about whether a projective plane of order s exists.

  1. 1.

    If s is a power of a prime, a projective plane of order s exists and so we have \(L(s^2+s)=\frac{1}{s}\) and \(L(s^2+s+1)=\frac{s+1}{s^2+s+1}\).

  2. 2.

    If it is not known whether a projective plane of order s exists, we cannot determine whether either of \(L(s^2+s)=\frac{1}{s}\) or \(L(s^2+s+1)=\frac{s+1}{s^2+s+1}\) holds since no almost projective plane of order greater than 3 is known.

  3. 3.

    If Theorem 4.11 rules out the existence of a projective plane of order s, then Theorem 4.12 also establishes the nonexistence of an almost projective plane of order s (any order allowed to exist by Theorem 4.12 is \(i^2+1\) or \(i^2-1\) for some i and hence either is a sum of two squares or is congruent to 0 or 3 modulo 4). Thus, we have that \(L(s^2+s) \ne \frac{1}{s}\) and \(L(s^2+s+1) \ne \frac{s+1}{s^2+s+1}\) in these situations.

  4. 4.

    The nonexistence of a projective plane of order 10 has been established using heavy computation [13]. However an almost projective plane of order 10 is not ruled out by Theorem 4.12 (note that \((x,y,z)=(1,1,3)\) is a solution to \(11x^2-2y^2=z^2\)). So, if \(s=10\), we have \(L(s^2+s+1) \ne \frac{s+1}{s^2+s+1}\) but we do not know whether \(L(s^2+s) = \frac{1}{s}\) or not.

5 An improved bound on the data limit

Our main goal in this section is to prove Theorem 1.5. However, we first note that Theorem 1.3 is already enough to give the asymptotic behaviour of the data limit.

Corollary 5.1

As \(m \rightarrow \infty \), we have \(L(m) = (1+o(1))m^{-1/2}\).

Proof

Let m be a positive integer. Since \( \sqrt{m}-1 \leqslant \lfloor \sqrt{m} \rfloor \leqslant \sqrt{m}\), it follows from Theorem 1.3 that \(L(m) \geqslant m^{-1/2}\). We will complete the proof by showing that \(L(m) \leqslant (1+o(1))m^{-1/2}\). Let \(q=\lfloor \sqrt{m} - \frac{1}{2} \rfloor \). Provided that m is sufficiently large, by the main result of [1], there is a prime p such that \(q-q^{0.525} \leqslant p \leqslant q\). So there is an affine plane of order p and \(L(p^2+p)=\frac{1}{p}\) by Theorem 1.3. Thus, because \(p \leqslant q \leqslant \sqrt{m} - \frac{1}{2}\), we have \(p^2+p \leqslant m\) and hence \(L(m) \leqslant L(p^2+p)=\frac{1}{p}\). Now \(q \geqslant \sqrt{m} - \frac{3}{2}\) and hence

$$\begin{aligned}p \geqslant \sqrt{m} - \tfrac{3}{2} - \left( \sqrt{m} - \tfrac{3}{2}\right) ^{0.525} \geqslant \sqrt{m} - m^{0.2625} - \tfrac{3}{2}.\end{aligned}$$

Thus

\(\square \)

We now turn our attention to proving Theorem 1.5. Throughout this section we define \(F:[2,\infty ) \rightarrow {\mathbb {R}}\) to be the function given by

where s is the unique positive integer such that \(s^2-s+1< m \leqslant s^2+s+1\). The right hand side of (2) is exactly F(m). Note that \(F(m)=1\) for \(m \in \{2,3\}\). We establish some other basic properties of F.

Lemma 5.2

  1. (a)

    For each positive integer s, \(F(s^2+s)=\frac{1}{s}\) and \(F(s^2+s+1)=\frac{s+1}{s^2+s+1}\).

  2. (b)

    F is continuous and monotonically decreasing on \([3,\infty )\).

  3. (c)

    For each integer \(m \geqslant 3\), F(m) is at least the bound of Theorem 1.3, with equality if and only if \(m \in \{s^2+s,s^2+s+1\}\) for some positive integer s.

Proof

Part (a) follows by making the appropriate substitutions and simplifying. For each integer \(s \geqslant 2\), F is continuous on the interval \((s^2-s+1,s^2+s+1)\) with derivative

which is negative. Thus, (b) follows by observing that, for each positive integer s, F is also continuous at \(s^2+s+1\). For (c), let \(m \geqslant 3\) be an integer and let s be the positive integer such that \(s^2-s+1 < m \leqslant s^2+s+1\). The claims of equality follow from (a). If \(s^2 \leqslant m < s^2+s\), then the bound of Theorem 1.3 is \(\frac{1}{s}\) and \(F(m) > F(s^2+s)=\frac{1}{s}\) using (a) and (b). If \(s^2-s+1< m < s^2\), then the bound of Theorem 1.3 is \(\frac{s}{m}\) and simplification shows that \(F(m)>\frac{s}{m}\) if and only if the radical in F(m) is greater than \(s(2s^2-2m+s-1)+1\). Squaring both sides of this inequality, simplifying and cancelling a common factor of \(4(s^2+s-1)\) we see that in turn this holds if and only if

$$\begin{aligned} -m^2 + (2s^2+1)m-4s(s^3+1) > 0. \end{aligned}$$
(9)

The left hand side of (9) is a concave quadratic function of m with roots at \(s^2-s+1\) and \(s^2+s\). Thus it is positive for all m satisfying \(s^2-s+1< m < s^2\) and we have the desired result. \(\square \)

Figure 1 gives a visual comparison of the bounds of Theorems 1.3 and 1.5 for small values of m.

Fig. 1
figure 1

A comparison of the bounds of Theorems 1.3 and 1.5 and, where applicable, known exact values for \(3 \leqslant m \leqslant 21\). For visual clarity we treat each of the two bounds as a function of a real variable m. Note that neither axis starts at 0

We prove the main part of Theorem 1.5 in the following lemma.

Lemma 5.3

For each integer \(m \geqslant 2\) we have \(L(m) \geqslant F(m)\).

Proof

Let s be the positive integer such that \(s^2-s+1< m \leqslant s^2+s+1\). For brevity, let \(\ell =L(m)\). By Lemma 3.3 there is a covering design \(D=(X,{\mathcal {B}})\) with \(|X|=|{\mathcal {B}}|=m\) and a normalised weighting of X such that \(L(D,w)=\ell \). If \(m = 2\), then \(\ell =F(m)=1\). If \(m=3\), then \(\ell =F(m)=\frac{2}{3}\) by Theorem 1.3. So we may assume that \(m \geqslant 4\) and hence that \(s \geqslant 2\). If \(m \in \{s^2+s,s^2+s+1\}\), then the result follows by Theorem 1.3 and Lemma 5.2(a). So we may assume that \(m < s^2+s\) and hence, by the contrapositive of the first part of Lemma 4.9, that \(\ell >\frac{1}{s}\). By Lemma 5.2(a) and (b) we have that \(F(m) < F(s^2-s) =\frac{1}{s-1}\). So we may suppose that \(\ell <\frac{1}{s-1}\) for otherwise we are done. Thus we have \(r_x \geqslant s\) for each \(x \in X\), for otherwise we would have a contradiction to \(w(x) \geqslant 0\) using Lemma 4.2.

For each \(i \in \{s,\ldots ,m\}\), let \(X_i=\{x \in X:r_x=i\}\) and \(y_i=\sum _{x \in X_i}w(x)\). Note that \(\{X_s,\ldots ,X_m\}\) is a partition of X. Since w is normalised we have

$$\begin{aligned} \sum _{i=s}^m y_i = 1. \end{aligned}$$
(10)

By Lemma 4.2, for each \(i \in \{s,\ldots ,m\}\), we have \(y_i \leqslant \frac{i\ell -1}{i-1}|X_i|\) and hence \(\frac{i-1}{i\ell -1}\,y_i \leqslant |X_i|\). Thus, because \(\sum _{i=s}^m|X_i|=m\), we have

(11)

Furthermore, by Lemma 4.1,

$$\begin{aligned} m\ell \geqslant \sum _{i=s}^m iy_i. \end{aligned}$$
(12)

We will now effectively bound \(\sum _{i=s}^m iy_i\) below by the optimal value of the linear program that minimises \(\sum _{i=s}^m iy_i\) subject to the constraints (10) and (11). Take \(\alpha \) times equality (10) and subtract \(\beta \) times inequality (11) where \(\alpha =\frac{(s^2-s-1)\ell +1}{1-\ell }\) and \(\beta =\frac{(s\ell -1)(s\ell +\ell -1)}{1-\ell }\) are positive. After simplification, this yields

(13)

Note that our choice of \(\alpha \) and \(\beta \) was the unique one under which the resulting inequality would have s as the coefficient of \(y_s\) and \(s+1\) as the coefficient of \(y_{s+1}\). Clearly, \(c_i \geqslant 0\) for each \(i \in \{s,\ldots ,m\}\) and so from (12) and (13) we can deduce that \(m\ell \) is at least the right hand side of (13). The resulting inequality simplifies to

$$\begin{aligned} m(s^2+s-1)\ell ^2-(s^2+2ms-s-1)\ell + m-1 \geqslant 0 \end{aligned}$$
(14)

which we can view as a quadratic function of \(\ell \) with two real roots. We saw earlier that \(\ell > \frac{1}{s}\) and we now observe that substituting \(\ell =\frac{1}{s}\) into (14) yields

which is nonpositive since \(m \leqslant s^2+s\). Thus, because \(\ell > \frac{1}{s}\), we have that \(\ell \) must be at least the larger of two roots of (14). This larger root is exactly F(m). \(\square \)

We now proceed to prove the rest of Theorem 1.5 by re-examining the proof of Lemma 5.3, analysing how equality in the bound could arise. A near pencil of order m is a linear space with m points and m blocks such that one block has size \(m-1\) and every other block has size 2.

Lemma 5.4

For each integer \(m \geqslant 2\), if \(L(m)=F(m)\), then \(m=4\) or \(m \in \{s^2+s,s^2+s+1\}\) for some positive integer s.

Proof

For brevity, let \(\ell =L(m)\). Suppose that \(\ell =F(m)\) and that \(m \notin \{s^2+s,s^2+s+1\}\) for any positive integer s. We must show that \(m=4\). By Lemma 3.3 there is a covering design \(D=(X,{\mathcal {B}})\) with \(|X|=|{\mathcal {B}}|=m\) and a normalised weighting w of X such that \(L(D,w)=\ell \). Let s be the positive integer such that \(s^2-s+1 < m \leqslant s^2+s+1\) and note \(m < s^2+s\) by our supposition. Thus \(\ell = F(m) > \frac{1}{s}\) by Lemma 5.2(a) and (b). For each \(i \in \{s,\ldots ,m\}\), let \(X_i=\{x \in X:r_x=i\}\) and \(y_i=\sum _{x \in X_i}w(x)\).

Following the proof of Lemma 5.3, we have \(r_x \geqslant s\) for each \(x \in X\). Further, because \(\ell =F(m)\), we must have equality in (14) and hence we must have equality in both (12) and (13), and the right side of (12) must equal the left side of (13). Note that, for \(c_i\) as defined in (13), we have \(c_i = 0\) for \(i \in \{s,s+1\}\) and \(c_i>0\) for \(i \in \{s+2,\ldots ,m\}\). Thus equality between the right side of (12) and the left side of (13) gives \(\sum _{i=s}^m (i-c_i)y_i=\sum _{i=s}^m iy_i\) which implies that \(y_i=0\) for \(i \in \{s+2,\ldots ,m\}\). Since (13) is obtained from an equality by subtracting a positive multiple of inequality (11), equality in (13) implies equality in (11). This in turn implies that \(y_i=\frac{i\ell -1}{i-1}|X_i|\) for each \(i \in \{s,\ldots ,m\}\) (note that \(\frac{i\ell -1}{i-1}\) is positive since \(\ell > \frac{1}{s}\)). Hence \(|X_i|=0\) for each \(i \in \{s+2,\ldots ,m\}\) and \(w(x)=\frac{i\ell -1}{i-1}\) for all \(x \in X_i\) and \(i \in \{s,s+1\}\). It follows that w is positive and we have equality in (5) for each \(x \in X\). So, by applying Lemma 4.2 to each \(x \in X\), we see that any two points in X occur together in exactly one block and so \((X,{\mathcal {B}})\) is a linear space with \(|X|=|{\mathcal {B}}|=m\). The De Bruijn-Erdős theorem in incidence geometry [6] states that such a space must be either a finite projective plane or a near pencil.

Now D cannot be a finite projective plane because \(m \ne s^2+s+1\), and hence D is a near-pencil. So \(r_y=m-1\) for some \(y \in X\) and \(r_x=2\) for each \(x \in X \setminus \{y\}\). However \(r_x \in \{s,s+1\}\) for all \(x \in X\) because we have seen that \(|X_i|=0\) for each \(i \in \{s+2,\ldots ,m\}\). So it must be the case that \(s=2\) and \(m=4\). \(\square \)

Recall that we saw in Example 1.2 that we do indeed have \(L(4)=F(4)=\frac{3}{5}\) and that this is achieved by an appropriately weighted near pencil of order 4.

Proof of Theorem 1.5

This follows from Lemmas 5.3 and 5.4. \(\square \)

6 Conclusion

Our knowledge of L(b) when b is the number of blocks in a finite projective or affine plane, together with the fact that L(m) is nondecreasing in m, gives us naive lower and upper bounds on L(m) for all m. Theorem 1.5 provides a substantial improvement on these naive lower bounds. However, we have no general improvement on the naive upper bound given by \(L(m) \leqslant \frac{s+1}{s^2+s+1}\) where s is the greatest prime power such that \(s^2+s+1 \leqslant m\). Of course, known upper bounds on covering numbers provide upper bounds on L(m) via Lemma 3.4, and we have discussed how balanced incomplete block designs, transversal designs, and finite projective Hjelmslev planes can provide upper bounds. However, we believe the lack of more systematic upper bounds still warrants attention.

Results on the existence of almost projective planes would impact our results in Theorem 1.4 and would be of independent interest. For our purposes, we could omit the mention of almost projective planes from Theorem 1.4(b) if it were shown that the orders for which almost projective planes exist formed a subset of those for which projective planes exist. Given that no almost projective plane of order greater than 3 is known, this may well be true. Proving it may not be a simple matter, however.