-
Notifications
You must be signed in to change notification settings - Fork 663
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when loading multiple large DCD trajectories #4039
Comments
I will confirm but this may be related to a change in DCD reading in #3882. |
I don't think this is a DCD corruption issue, but instead a problem with an int overflow in the DCD code on Windows machines, or more specifically the issue outlined here: #3075 @vdeshchenya could you comment on how many frames you were expected to read from this DCD? |
@IAlibay IIRC did recently get this on my Linux box but will confirm. |
ah I missed that OP was on macos, that's a lot more worrisome. |
I can reproduce on macOS 10.15.7, Python 3.10, current mda develop
|
And it has nothing to do with MemoryReader
|
I installed at 2d98ad5 and I can read the trajectory (see below). This is a regression introduced after 2d98ad5 (possibly #3882) but I haven't run In [6]: u_long = mda.Universe(PSF, 'test.dcd', 'test.dcd')
In [7]: for ts in mda.log.ProgressBar(u_long.trajectory):
...: pass
...:
100%|█████████████████████████████████| 117600/117600 [00:14<00:00, 8292.91it/s] create_test.pyRun once to create the test trajectory: import MDAnalysis as mda
from MDAnalysis.tests.datafiles import PSF, DCD
u = mda.Universe(PSF, DCD)
# creating a big trajectory (2.2 GB)
ag = u.select_atoms("all")
with mda.Writer('test.dcd', ag.n_atoms) as w:
for _ in mda.log.ProgressBar(range(600)):
for ts in u.trajectory:
w.write(ag) (I haven't tested if the trajectory size (2.2G) or the number of frames is the issue.) read trajectoryShould go through without raising an exception import MDAnalysis as mda
from MDAnalysisTests.datafiles import PSF
u_long = mda.Universe(PSF, 'test.dcd', 'test.dcd')
for ts in mda.log.ProgressBar(u_long.trajectory):
pass |
Instead of
|
@hmacdope do you know how to fix this? This is a pretty serious regression, given the prevalence of DCD files. |
Ill have a look, I think I know what the issue is. |
@hmacdope is this just affecting develop or a previous released version too? |
Yep @IAlibay >= |
happens, do we want to do a bugfix release? |
I can probably look at this tomorrow. If it’s just int to ssize_t it’ll be
simple enough
…On Tue, Feb 28, 2023 at 22:02, Irfan Alibay ***@***.***> wrote:
Yep @IAlibay <https://github.com/IAlibay> >= 2.4.0 is affected. 😞
happens, do we want to do a bugfix release?
—
Reply to this email directly, view it on GitHub
<#4039 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ACGSGB3NTLC3O73UQIYREKLWZZYV3ANCNFSM6AAAAAAVHD2FZU>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
The test is a bit annoying because we need to generate the trajectory but… eventually having a “big traj” test for all readers might be useful. |
Can probably be done programmatically for all readers in the Reader test base class thingo. |
this line will overflow with
|
use fio_size_t for all variables related to filesizes fixes for #4039
@sef43 thanks for the tip! much appreciated. I think @richardjgowers has sorted this in his fix (#4048). Many thanks for cleaning up after me. |
Fixes #4039 * Fixes DCD seeking for large (2Gb+) files.
Fixes #4039 * Fixes DCD seeking for large (2Gb+) files.
I am having the same problem. Mdanalysis version is 2.8. |
@qlearn-code can you please open a new issue? Describe your problem and you can mention this issue. Until we determine that your problem has the same cause as this one (which was supposedly fixed) we want to discuss it independently as this helps us to better keep track of problems. Thank you. |
Got it! Just resubmit the issue under bug report. Thanks!
From: Oliver Beckstein ***@***.***>
Sent: Thursday, January 2, 2025 3:43 PM
To: MDAnalysis/mdanalysis ***@***.***>
Cc: Liu, Qian ***@***.***>; Mention ***@***.***>
Subject: Re: [MDAnalysis/mdanalysis] Error when loading multiple large DCD trajectories (Issue #4039)
@ qlearn-code can you please open a new issue? Describe your problem and you can mention this issue. Until we determine that your problem has the same cause as this one (which was supposedly fixed) we want to discuss it independently as this
ZjQcmQRYFpfptBannerStart
This Message Is From an External Sender
This message came from outside your organization.
ZjQcmQRYFpfptBannerEnd
@qlearn-code<https://urldefense.us/v3/__https:/github.com/qlearn-code__;!!G_uCfscf7eWS!akT8v7kYadT2CcrRw8XsPtWzdTi3kWZ4P_QTmv72XN_l9w2wjb78bTW-yvELIqdopUnFXbcwCG_kbMDoIBYPL7OQ$> can you please open a new issue?
Describe your problem and you can mention this issue.
Until we determine that your problem has the same cause as this one (which was supposedly fixed) we want to discuss it independently as this helps us to better keep track of problems. Thank you.
—
Reply to this email directly, view it on GitHub<https://urldefense.us/v3/__https:/github.com/MDAnalysis/mdanalysis/issues/4039*issuecomment-2568416581__;Iw!!G_uCfscf7eWS!akT8v7kYadT2CcrRw8XsPtWzdTi3kWZ4P_QTmv72XN_l9w2wjb78bTW-yvELIqdopUnFXbcwCG_kbMDoIEgGGvUe$>, or unsubscribe<https://urldefense.us/v3/__https:/github.com/notifications/unsubscribe-auth/BJ5WFGKXT6ABI75BN3ATPPD2IWXF3AVCNFSM6AAAAABUQTENFCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNRYGQYTMNJYGE__;!!G_uCfscf7eWS!akT8v7kYadT2CcrRw8XsPtWzdTi3kWZ4P_QTmv72XN_l9w2wjb78bTW-yvELIqdopUnFXbcwCG_kbMDoIM5NxstZ$>.
You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>
|
Issue no is #4879
From: Liu, Qian
Sent: Thursday, January 2, 2025 4:47 PM
To: MDAnalysis/mdanalysis ***@***.***>; MDAnalysis/mdanalysis ***@***.***>
Cc: Mention ***@***.***>
Subject: RE: [MDAnalysis/mdanalysis] Error when loading multiple large DCD trajectories (Issue #4039)
Got it! Just resubmit the issue under bug report. Thanks!
From: Oliver Beckstein ***@***.******@***.***>>
Sent: Thursday, January 2, 2025 3:43 PM
To: MDAnalysis/mdanalysis ***@***.******@***.***>>
Cc: Liu, Qian ***@***.******@***.***>>; Mention ***@***.******@***.***>>
Subject: Re: [MDAnalysis/mdanalysis] Error when loading multiple large DCD trajectories (Issue #4039)
@ qlearn-code can you please open a new issue? Describe your problem and you can mention this issue. Until we determine that your problem has the same cause as this one (which was supposedly fixed) we want to discuss it independently as this
ZjQcmQRYFpfptBannerStart
This Message Is From an External Sender
This message came from outside your organization.
ZjQcmQRYFpfptBannerEnd
@qlearn-code<https://urldefense.us/v3/__https:/github.com/qlearn-code__;!!G_uCfscf7eWS!akT8v7kYadT2CcrRw8XsPtWzdTi3kWZ4P_QTmv72XN_l9w2wjb78bTW-yvELIqdopUnFXbcwCG_kbMDoIBYPL7OQ$> can you please open a new issue?
Describe your problem and you can mention this issue.
Until we determine that your problem has the same cause as this one (which was supposedly fixed) we want to discuss it independently as this helps us to better keep track of problems. Thank you.
—
Reply to this email directly, view it on GitHub<https://urldefense.us/v3/__https:/github.com/MDAnalysis/mdanalysis/issues/4039*issuecomment-2568416581__;Iw!!G_uCfscf7eWS!akT8v7kYadT2CcrRw8XsPtWzdTi3kWZ4P_QTmv72XN_l9w2wjb78bTW-yvELIqdopUnFXbcwCG_kbMDoIEgGGvUe$>, or unsubscribe<https://urldefense.us/v3/__https:/github.com/notifications/unsubscribe-auth/BJ5WFGKXT6ABI75BN3ATPPD2IWXF3AVCNFSM6AAAAABUQTENFCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNRYGQYTMNJYGE__;!!G_uCfscf7eWS!akT8v7kYadT2CcrRw8XsPtWzdTi3kWZ4P_QTmv72XN_l9w2wjb78bTW-yvELIqdopUnFXbcwCG_kbMDoIM5NxstZ$>.
You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>
|
Expected behavior
I create a Universe with several large trajectory files (size > 2 GB) and use the transfer_to_memory function to load them.
Actual behavior
The transfer_to_memory function fails after loading 53458/117600 frames with the error "OSError: DCD seek failed with DCD error=Normal EOF".
(53458 frames take up about 2 GB of memory)
Code to reproduce the behavior
Current version of MDAnalysis
python -c "import MDAnalysis as mda; print(mda.__version__)"
): 2.4.2python -V
)?: Python 3.9.15The text was updated successfully, but these errors were encountered: