New Nikodym set constructions over finite fields
I have uploaded to the arXiv my paper “New Nikodym set constructions over finite fields“. This is a spinoff of my previous project with Bogdan Georgiev, Javier Gómez–Serrano, and Adam Zsolt Wagner that I recently posted about. In that project we experimented with using AlphaEvolve (and other tools, such as DeepThink and AlphaProof) to explore various mathematical problems which were connected somehow to an optimization problem. For one of these — the finite field Nikodym set problem — these experiments led (by a somewhat convoluted process) to an improved asymptotic construction of such sets, the details of which are written up (by myself rather than by AI tools) in this paper.
Let
be a finite field of some order
(which must be a prime or a power of a prime), and let
be a fixed dimension. A Nikodym set in
is a subset
of
with the property that for every point
, there exists a line
passing through
such that all points of
other than
lie in
. Such sets are close cousins of Kakeya sets (which contain a line in every direction); indeed, roughly speaking, applying a random projective transformation to a Nikodym set will yield (most of) a Kakeya set. As a consequence, any lower bound on Kakeya sets implies a similar bound on Nikodym sets; in particular, one has a lower bound
, coming from a similar bound on Kakeya sets due to Bukh and Chao using the polynomial method.For Kakeya sets, Bukh and Chao showed this bound to be sharp up to the lower order error
; but for Nikodym sets it is conjectured that in fact such sets should asymptotically have full density, in the sense that
) by Guo, Kopparty, and Sudan by combining the polynomial method with the theory of linear codes. But in other cases this conjecture remains open in three and higher dimensions.In our experiments we focused on the opposite problem of constructing Nikodym sets of size as small as possible. In the plane
, constructions of size when
is a perfect square were constructed by Blokhuis et al, again using the theory of blocking sets; by taking Cartesian products of such sets, one can also make similar constructions in higher dimensions, again assuming
is a perfect square. Apart from this, though, there are few such constructions in the literature.
We set AlphaEvolve to try to optimize the three dimensional problem with a variable field size
(which we took to be prime for simplicity), with the intent to get this tool to come up with a construction that worked asymptotically for large
, rather than just for any fixed value of
. After some rounds of evolution, it arrived at a construction which empirically had size about
. Inspecting the code, it turned out that AlphaEvolve had constructed a Nikodym set
by (mostly) removing eight low-degree algebraic surfaces (all of the form
for various
). We used the tool DeepThink to confirm the Nikodym property and to verify the construction, and then asked it to generalize the method. By removing many more than eight surfaces, and using some heuristic arguments based on the Chebotarev density theorem, DeepThink claimed a construction of size formed by removing several higher degree surfaces, but it acknowledged that the arguments were non-rigorous.
The arguments can be sketched here as follows. Let
be a random surface of degree
, and let
be a point in
which does not lie in
. A random line through
then meets
in a number of points, which is basically the set of zeroes in
of a random polynomial of degree
. The (function field analogue of the) Chebotarev density theorem predicts that the probability that this polynomial has no roots in
is about
, where
elements that are derangements (no fixed points). So, if one removes
random surfaces of degrees
, the probability that a random line avoids all of these surfaces is about
. If this product is significantly greater than
, then the law of large numbers (and concentration of measure) then predicts (with high probability) that out of the
lines through
, at least one will avoid the removed surfaces, thus giving (most of) a Nikodym set. The Lang-Weil estimate predicts that each surface has cardinality about
, so this should give a Nikodym set of size about
.DeepThink took the degrees
to be large, so that the derangement probabilities
were close to
. This led it to predict that
could be taken to be as large as
, leading to the claimed bound (2). However, on inspecting this argument we realized that these moderately high degree surfaces were effectively acting as random sets, so one could dramatically simplify DeepThink’s argument by simply taking
to be a completely random set of the desired cardinality (2), in which case the verification of the Nikodym set property (with positive probability) could be established by a standard Chernoff bound-type argument (actually, I ended up using Bennett’s inequality rather than Chernoff’s inequality, but this is a minor technical detail).
On the other hand, the derangement probabilities
oscillate around
, and in fact are as large as
when
. This suggested that one could do better than the purely random construction if one only removed quadratic surfaces instead of higher degree surfaces, and heuristically predicted the improvement However, our experiments with both AlphaEvolve and DeepThink to try to make this idea work either empirically, heuristically, or rigorously were all unsuccessful! Eventually Deepthink discovered the problem: random quadratic polynomials often had two or zero roots (depending on whether the discriminant was a non-zero quadratic residue, or a nonresidue), but would only very rarely have just one root (the discriminant would have to vanish). As a consequence, if
happened to lie on one of the removed quadratic surfaces
, it was extremely likely that most lines through
would intersect
in a further point; only the small minority of lines that were tangent to
and
would avoid this. None of the AI tools we tried were able to overcome this obstacle.
However, I realized that one could repair the construction by adding back a small random portion of the removed quadratic surfaces, to allow for a non-zero number of lines through
to stay inside the putative Nikodym set even when
was in one of the surfaces
, and the line was not tangent to
. Pursuing this idea, and performing various standard probabilistic calculations and projective changes of variable, the problem essentially reduced to the following: given
random quadratic polynomials in the plane
, is it true that these polynomials simultaneously take quadratic residue values for
of the points in that plane? Heuristically this should be true even for
close to
. However, it proved difficult to accurately control this simultaneous quadratic residue event; standard algebraic geometry tools such as the Weil conjectures seemed to require some vanishing of étale cohomology groups in order to obtain adequate error terms, and this was not something I was eager to try to work out. However, by exploiting projective symmetry (and the
-transitive nature of the projective linear group), I could get satisfactory control of such intersections as long as
was a little bit larger than
rather than
. This gave an intermediate construction of size
We also looked at the two-dimensional case to see how well AlphaEvolve could recover known results, in the case that
was a perfect square. It was able to come up with a construction that was slightly worse than the best known construction, in which one removed a large number of parabolas from the plane; after manually optimizing the construction we were able to recover the known bound (1). This final construction is somewhat similar to existing constructions (it has a strong resemblance to a standard construction formed by taking the complement of a Hermitian unital), but is still technically a new construction, so we have also added it to this paper.
Terence Tao's Blog
- Terence Tao's profile
- 230 followers

