Update values to int64 #13548

teonbrooks · 2025-12-16T18:56:29Z

Fix attempt as fixing the overflow issue in the read_raw_cnt reader. This error has manifested with numpy upgrade.

Reference issue

Fixes #13547.

What does this implement/fix?

This follows a pattern suggested in #12907 to cast the integer to int64.

larsoner · 2025-12-16T19:05:27Z

To read your file it needs a few more fixes actually... I'll push

larsoner · 2025-12-16T19:16:47Z

Definitely still something wrong here...

$ python -uic "import mne; raw = mne.io.read_raw_cnt('~/Desktop/945flankers_ready.cnt', data_format='int16').load_data(); raw.plot(annotation_regex='aaa')"
Traceback (most recent call last):
  File "<string>", line 1, in <module>
    import mne; raw = mne.io.read_raw_cnt('~/Desktop/945flankers_ready.cnt', data_format='int16').load_data(); raw.plot(annotation_regex='aaa')
                      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^
  File "<decorator-gen-190>", line 12, in load_data
  File "/home/larsoner/python/mne-python/mne/io/base.py", line 589, in load_data
    self._preload_data(True)
    ~~~~~~~~~~~~~~~~~~^^^^^^
  File "/home/larsoner/python/mne-python/mne/io/base.py", line 601, in _preload_data
    self._data = self._read_segment(data_buffer=data_buffer)
                 ~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<decorator-gen-189>", line 12, in _read_segment
  File "/home/larsoner/python/mne-python/mne/io/base.py", line 420, in _read_segment
    data = _allocate_data(data_buffer, data_shape, dtype)
  File "/home/larsoner/python/mne-python/mne/io/base.py", line 2577, in _allocate_data
    data = np.zeros(shape, dtype)
numpy._core._exceptions._ArrayMemoryError: Unable to allocate 2.06 TiB for an array with shape (66, 4294966564) and data type float64

Same error if I use data_format='int32'. If I remove the .load_data and use data_format='int32' the plot at least looks okay

So need to figure out the n_samples issue, 4294966564 samples for 66 channels is totally unreasonable for a 150MB file...

larsoner · 2025-12-16T19:29:02Z

mne/io/cnt/_utils.py

-        (n_samples,) = np.frombuffer(fid.read(4), dtype="<i4")
+        fid.seek(_NSAMPLES_OFFSET)
+        n_samples = int(np.frombuffer(fid.read(4), dtype="<u4").item())


Interestingly, this was <i4 here but <u4 in cnt.py. When we use <i4 in cnt.py we get a negative n_samples (-732)

I just reverted it back to "<i4", it should be a signed int. I thought something was wrong when I first encountered it sign-wise.

but yeah, I'm getting the same values as you with the "<i4". I guess that's why there is the later implementation mentioned below.

larsoner · 2025-12-16T19:31:31Z

@teonbrooks I'm done pushing/looking for now, I hope the changes I made help debugging a bit more. Something is wrong with n_samples here, it gets read as 4294966564 ...

teonbrooks · 2025-12-17T14:39:37Z

thanks @larsoner!

came across this post and adding it here for reference in the future
https://paulbourke.net/dataformats/eeg/

teonbrooks · 2025-12-17T14:45:20Z

@teonbrooks I'm done pushing/looking for now, I hope the changes I made help debugging a bit more. Something is wrong with n_samples here, it gets read as 4294966564 ...

according to the link above, it looks like this is not an uncommon occurrence:

Experience has shown that many (most) of the fields are not filled out correctly by the software. In particular, the best way to work out the number of samples is

it looks like n_samples should be calculated as:

nsamples = SETUP.EventTablePos - (900 + 75 * nchannels) / (2 * nchannels)

larsoner · 2025-12-17T15:15:59Z

Great, can you add some comments / links in the code for the next time we dig into this, and try the suggested fix?

teonbrooks · 2025-12-20T00:36:23Z

added a note. after trying it out and I look more closely at the code, it looks as though the n_samples logic is already there starting at https://github.com/mne-tools/mne-python/blob/main/mne/io/cnt/cnt.py#L339.

teonbrooks · 2025-12-20T00:37:41Z

I actually don't know what to do about the n_samples. it looks like the code already is trying to best handle the data without knowing the data_format and with the header not having a reliable header entry for it.

larsoner requested review from agramfort, dengemann, drammock and larsoner as code owners December 16, 2025 19:17

larsoner reviewed Dec 16, 2025

View reviewed changes

teonbrooks force-pushed the cnt-overflow-fix branch from a5c1f92 to 603dd01 Compare December 20, 2025 00:36

teonbrooks and others added 8 commits December 20, 2025 00:43

Update values to int64

69e88e6

FIX: Numbers

2f00e82

[autofix.ci] apply automated fixes

b840023

FIX: Unify

cbd3b3f

Remove code used for debugging

94bda9e

added note

5371c29

revert back to "<i4" for n_samples

79a9cf4

formatting

66c24ca

teonbrooks force-pushed the cnt-overflow-fix branch from 603dd01 to 66c24ca Compare December 20, 2025 00:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Update values to int64 #13548

Update values to int64 #13548

teonbrooks commented Dec 16, 2025

Uh oh!

larsoner commented Dec 16, 2025

Uh oh!

larsoner commented Dec 16, 2025

Uh oh!

larsoner Dec 16, 2025

Uh oh!

teonbrooks Dec 20, 2025

Uh oh!

larsoner commented Dec 16, 2025

Uh oh!

teonbrooks commented Dec 17, 2025

Uh oh!

teonbrooks commented Dec 17, 2025

Uh oh!

larsoner commented Dec 17, 2025

Uh oh!

teonbrooks commented Dec 20, 2025

Uh oh!

teonbrooks commented Dec 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Update values to int64 #13548

Are you sure you want to change the base?

Update values to int64 #13548

Conversation

teonbrooks commented Dec 16, 2025

Reference issue

What does this implement/fix?

Uh oh!

larsoner commented Dec 16, 2025

Uh oh!

larsoner commented Dec 16, 2025

Uh oh!

larsoner Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

teonbrooks Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

larsoner commented Dec 16, 2025

Uh oh!

teonbrooks commented Dec 17, 2025

Uh oh!

teonbrooks commented Dec 17, 2025

Uh oh!

larsoner commented Dec 17, 2025

Uh oh!

teonbrooks commented Dec 20, 2025

Uh oh!

teonbrooks commented Dec 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants