compiler: Add a utility to estimate memory usage for an operator #2659

EdCaunt · 2025-07-03T16:53:11Z

No description provided.

codecov · 2025-07-03T17:10:11Z

Codecov Report

❌ Patch coverage is 89.67972% with 29 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (main@dd1e6c7). Learn more about missing BASE report.
⚠️ Report is 12 commits behind head on main.

Files with missing lines	Patch %	Lines
devito/operator/operator.py	69.23%	26 Missing and 2 partials ⚠️
devito/tools/data_structures.py	94.44%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2659   +/-   ##
=======================================
  Coverage        ?   91.58%           
=======================================
  Files           ?      248           
  Lines           ?    49634           
  Branches        ?     4368           
=======================================
  Hits            ?    45459           
  Misses          ?     3458           
  Partials        ?      717

Flag	Coverage Δ
pytest-gpu-nvc-nvidiaX	`72.06% <89.32%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

devito/operator/operator.py

devito/types/sparse.py

devito/operator/operator.py

tests/test_operator.py

enwask

Property names feel kinda messy, also not a fan of the hardcoded column width but otherwise looks good

devito/operator/operator.py

devito/tools/data_structures.py

tests/test_operator.py

Copilot

Pull Request Overview

This PR adds a new memory estimation utility estimate_memory() to the Operator class that calculates memory consumption without allocating or touching data. The feature enables users to estimate memory requirements for Operators with different parameters and overrides, which is particularly useful for planning large computations that might exceed available memory.

Key Changes:

Adds estimate_memory() method to Operator class for memory consumption estimation without data allocation
Extends _arg_defaults() and _arg_values() methods across function types to support memory estimation mode
Implements comprehensive memory tracking for Functions, Arrays, and device memory mapping in ArgumentsMap

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`devito/operator/operator.py`	Adds core `estimate_memory()` method and extensive memory tracking properties to `ArgumentsMap`
`devito/types/dense.py`	Adds `estimate_memory` parameter to avoid data allocation during estimation
`devito/types/sparse.py`	Extends sparse function types to support memory estimation mode
`devito/tools/data_structures.py`	Adds `MemoryEstimate` class for structured memory reporting
`tests/test_operator.py`	Comprehensive test suite covering various scenarios and edge cases
`.github/workflows/pytest-gpu.yml`	Includes new tests in GPU CI workflows

Comments suppressed due to low confidence (4)

devito/operator/operator.py:814

[nitpick] The estimate_memory parameter is checked in multiple places throughout the argument preparation flow. Consider extracting this logic into a separate method or using a context manager to reduce code duplication and improve maintainability.

                recompiled, src_file = self._compiler.jit_compile(self._soname, str(self))

tests/test_operator.py:133

The Function is created with a save parameter, but Function doesn't support the save parameter - this is only valid for TimeFunction. This should be TimeFunction instead of Function.

    def test_compiler_uniqueness(self):

tests/test_operator.py:196

This test relies on checking for a specific string pattern in generated C code to verify array temporary creation. This is fragile and could break if code generation changes. Consider using a more robust method to verify temporary array creation.

        ('Lt(0.1*(g1 + g2), 0.2*(g1 + g2))', 0,

devito/tools/data_structures.py

devito/operator/operator.py

JDBetteridge

Nitpick:

Is there a way to suppress unused keys. For instance, running locally I get:

MemoryEstimate(Kernel): {'host': '512 KB', 'device': '0 B'}

I didn't use any disk so the key doesn't appear. The system doesn't have a GPU, but the device key still appears.

Alternatively, and maybe this is preferable, ensure that all keys are shown to assure the user that no disk or device memory is being consumed.

EdCaunt · 2025-07-25T08:57:55Z

Alternatively, and maybe this is preferable, ensure that all keys are shown to assure the user that no disk or device memory is being consumed.

This is the intention. Moreover, disk can only be consumed in PRO (strictly speaking these are snapshots which will be moved back and forth as the various memory locations overflow), so the mapper is only enriched with snapshot information via a hook.

JDBetteridge · 2025-07-25T11:02:00Z

devito/tools/data_structures.py

+    def to_json(self, path):
+        """
+        Write the MemoryEstimate to JSON for ingestion by a scheduler.
+
+        Arguments
+        ---------
+        path: str
+            The path to which the memory estimate should be written.
+        """
+        summary = {'name': self.name, **self._dict}
+        json_object = json.dumps(summary, indent=4)
+
+        with open(path, "w") as outfile:
+            outfile.write(json_object)


devito/operator/operator.py

FabioLuporini · 2025-07-25T12:46:14Z

devito/operator/operator.py

        # Prepare to process data-carriers
        args = kwargs['args'] = ReducerMap()
+
        kwargs['metadata'] = {'language': self._language,


if we make estimate_memory part of metadata we can probably save some carrying around, both here and in the dense/sparse files

I had the estimate_memory switch in the kwargs at an earlier iteration and it made a lot of the _arg_defaults etc a fair bit messier, since they always need to know about it to avoid touching the data when performing overrides (so you had to get it from kwargs all over the place). Furthermore, I would argue that having it as an explicit kwarg reduces the likelihood that it will be accidentally overlooked when creating new subclasses in the future

why? if it's part of kwargs (or metadata) you only have to extract it in _arg_defaults at worst (maybe not even that depending on how you do things since it could be already explicit in the parameters list) , while currently it appears uselessly in all _arg_values only to be forwarded to _arg_defaults

obviously this is all a nitpicking, but since we're discussing it...

just to be clear: this is not a stopper at all for me, but I'm not convinced about your argument

Furthermore, I would argue that having it as an explicit kwarg reduces the likelihood that it will be accidentally overlooked when creating new subclasses in the future

However, this is a fair point too

If you put it in metadata, you need a whole load of:

if metadata is not None: estimate_memory = metadata.get('estimate_memory', False)

or

metadata = kwargs.get('metadata', {}) estimate_memory = metadata.get('estimate_memory', False)

in all the _arg_defaults (the form depending on the current _arg_defaults kwargs).

On top of uglifying the code (imo), it's much less obvious that the _arg_defaults for any array-carrying symbol should have special handling to avoid a data touch when merely estimating the memory (since users can and regularly will request an estimate for an operator that they don't have memory to run).

devito/operator/operator.py

FabioLuporini · 2025-07-28T07:54:20Z

devito/tools/data_structures.py

+class MemoryEstimate(frozendict):
+    """
+    An immutable mapper for a memory estimate, providing the estimated memory
+    consumption across host, device, and so forth.


and disk.

?

Strictly speaking, with PRO, you never get a 'disk' estimate, just a 'snapshot' estimate, as it would be far too much guesswork to estimate where the snaps are going to end up. Imo, figuring out if the values make sense is a user problem, and the raw data should not be obfuscated. The docstring is phrased to avoid giving too much away about what one can expect in PRO (or implying PRO features are in OSS), whilst avoiding a separate PRO class with an updated docstring.

FabioLuporini · 2025-07-28T07:55:39Z

devito/types/dense.py

            if isinstance(new, DiscreteFunction):
                # Set new values and re-derive defaults
-                values = new._arg_defaults(alias=self, metadata=metadata)
+                values = new._arg_defaults(alias=self, metadata=metadata,


going back to what I was saying earlier, for example, if estimate_memory had been part of metadata, you would need zero changes here

See my other comment, having estimate_memory in metadata makes the contents of the function messier, whilst making it less obvious that special handling should be implemented in all subclasses to prevent inadvertent OOM errors when estimating the memory consumption of an operator.

FabioLuporini · 2025-07-28T07:56:55Z

devito/types/sparse.py

        return args

-    def _arg_values(self, **kwargs):
+    def _arg_values(self, estimate_memory=False, **kwargs):


btw, for homogeneity at some point we could pass in the metadata too here

Why would it be needed though (considering my previous comments)?

review-notebook-app · 2025-08-04T12:52:15Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

EdCaunt commented Jul 4, 2025

View reviewed changes

devito/operator/operator.py Outdated Show resolved Hide resolved

EdCaunt requested review from FabioLuporini, JDBetteridge and mloubout July 9, 2025 12:15

mloubout added the API api (symbolics, types, ...) label Jul 9, 2025

JDBetteridge requested changes Jul 11, 2025

View reviewed changes

EdCaunt force-pushed the estimate-memory branch from fe13cd5 to e064d2f Compare July 15, 2025 14:34

mloubout requested changes Jul 15, 2025

View reviewed changes

enwask reviewed Jul 15, 2025

View reviewed changes

FabioLuporini reviewed Jul 16, 2025

View reviewed changes