FORM memory buffers #795

jodavies · 2026-02-05T16:42:48Z

jodavies
Feb 5, 2026
Maintainer

I would like to start some discusson on the FORM/TFORM default buffer sizes and how they relate to each other. Changes here would break some of the tests, which rely on specific buffer sizes to hit (fixed) buggy behaviour, which is a bit of a pain.

In particular, running FORM and TFORM with large MaxTermSize values causes really enormous memory allocations, which can be multiple TB. This makes it hard to debug some crashing scripts: valgrind fails to run due to failed mallocs, even if the configuration runs outside of valgrind (because FORM touches hardly any of the memory). The large allocations are primarily due to:

large + smallextension must be at least filepatches * (sortiosize + 2*maxtermsize). The manual claims that filepatches will be reduced to satisfy this, but in fact largesize is increased. This can mean that the worker large+smallextension can be larger than 1/workers times the master buffer sizes.
in TFORM, the "deferbuffer" allocation scales with threadbucketsize * maxtermsize. There is already code and commentary to reduce this somewhat, but a maxtermsize of 5120K and default threadbucketsize of 500 causes an allocation of 2 * 4.7GB per thread. Depending on the number of threads, this can easily exceed the worker large+smallextension!
in TFORM, for sufficiently large maxtermsize, the master large+smallextension starts to scale with the number of threads, since RecalcSetups enforces that this is at least (threads-1)*(1+NUMBEROFBLOCKSINSORT*MINIMUMNUMBEROFTERMS)*maxtermsize -- this buffer is used by the sortbots when performing the final sort to the output (?)

So my questions / things to experiment with are:

Can we reduce filepatches instead of increasing largesize, as described in the manual? This could lead to more stage4 sorts, but the allocations would be much smaller.
Should we more aggressively reduce the "deferbuffer" allocation for larger threadbucketsize and maxtermsize combinations? It seems unlikely that this buffer needs to be larger than the worker large+smallextension, for example.
What is the purpose of RecalcSetups? Why does it overlap (and conflict) with size relations in AllocSetups and AllocSort?
Is a 1:1 ratio between master and thread sorting buffers optimal? What if the master buffers had less space, and the worker buffers more? Such large master buffers often seems wasteful.

I will try to implement some changes, and run some benchmarks, and update this in the future.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FORM memory buffers #795

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

FORM memory buffers #795

Uh oh!

Uh oh!

jodavies Feb 5, 2026 Maintainer

Replies: 0 comments

jodavies
Feb 5, 2026
Maintainer