hanging while reading in gromacs trr trajectory #3789

dkonstan · 2022-08-24T20:26:44Z

Expected behavior

quick loading of GROMACS universe with trr trajectory into MDAnalysis

Actual behavior

it hangs indefinitely in some internal process and is resistant to Ctrl^C so the process is something very internal probably. DCD format etc works fine with same topology.

Code to reproduce the behavior

uni = MDAnalysis.Universe("topol.top", "some_gromacs_trajectory.trr", topology_format="ITP")

Current version of MDAnalysis

Which version are you using? 2.2.0
Which version of Python (python -V)? 3.9
Which operating system? Linux (cluster I think CentOS)

The text was updated successfully, but these errors were encountered:

richardjgowers · 2022-08-24T22:20:25Z

@dkonstan how large is the trajectory in question? There is an index of frames built on first load (for trr format) and this might be taking too long. Does a much smaller trajectory file work as expected?

dkonstan · 2022-08-24T22:47:58Z

You are right, it is a VERY large file (~800 GB). However, these huge trajectories load fast in other formats. I don't see why TRR needs to have a routine that loops over the whole thing. Is it something specific about the TRR format? Right now, I am simply converting TRR to DCD using mdtraj (I'm sorry to use a competitor hehe) and then using MDAnalysis straightforwardly, but it would be nice not to have to do this.

…

On Wed, Aug 24, 2022 at 6:20 PM Richard Gowers ***@***.***> wrote: @dkonstan <https://github.com/dkonstan> how large is the trajectory in question? There is an index of frames built on first load (for trr format) and this might be taking too long. Does a much smaller trajectory file work as expected? — Reply to this email directly, view it on GitHub <#3789 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ALLHQP2X5AHY5V5Q2KT6X6TV22N3HANCNFSM57QVGQIA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

richardjgowers · 2022-08-24T22:59:50Z

The TRR format doesn't allow seeking (jumping to a random frame) natively, so we build an index to be able to seek. Obviously this isn't working well for you. It would be nice if we could make this index building lazy (note frame offsets as they are read) or be able to disable it entirely for super large files like this.

orbeckst · 2022-08-25T14:47:10Z

Why do we have to build the index for TRR with iterating? TRR has a fixed frame size so we should be able to compute the index.

(XTC is a different issue.)

@dkonstan in general it’s important for MDA to have the index because that is the only feasible approach for us to guarantee fast random frame access for all trajectory formats while not reading the whole trajectory into memory.

orbeckst · 2022-08-25T14:48:43Z

@richardjgowers disabling the index will likely break fundamental assumptions about how we handle trajectories.

richardjgowers · 2022-08-25T14:53:04Z

@orbeckst maybe TRR isn't compressed but I think it still allows different strides in position force and velocity reporting... but I think yes maybe there's an analytical solution to seeking.

orbeckst · 2022-08-25T15:18:35Z

Hm, yes, you are right. You never know what you find in a TRR step.

jbarnoud · 2022-08-26T06:16:30Z

It would be nice if we could make this index building lazy (note frame offsets as they are read)

I just opened #3793 in that direction.

dkonstan · 2022-10-11T07:42:42Z

Got it, thank you! Yes that would be great.

…

On Wed, Aug 24, 2022 at 7:00 PM Richard Gowers ***@***.***> wrote: The TRR format doesn't allow seeking (jumping to a random frame) natively, so we build an index to be able to seek. Obviously this isn't working well for you. It would be nice if we could make this index building lazy (note frame offsets as they are read) or be able to disable it entirely for super large files like this. — Reply to this email directly, view it on GitHub <#3789 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ALLHQP5QKSOWK5AS6KJC2ILV22SPDANCNFSM57QVGQIA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

orbeckst · 2023-03-11T17:14:30Z

Would it be useful to have some indication that index building is happening, such as a progressbar or at least a message?

hmacdope added the Format-Gromacs label Aug 24, 2022

jbarnoud mentioned this issue Aug 26, 2022

Lazy index building for XTC and TRR tajectories #3793

Open

orbeckst added the performance label Aug 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hanging while reading in gromacs trr trajectory #3789

hanging while reading in gromacs trr trajectory #3789

dkonstan commented Aug 24, 2022

richardjgowers commented Aug 24, 2022

dkonstan commented Aug 24, 2022 via email

richardjgowers commented Aug 24, 2022

orbeckst commented Aug 25, 2022

orbeckst commented Aug 25, 2022

richardjgowers commented Aug 25, 2022

orbeckst commented Aug 25, 2022

jbarnoud commented Aug 26, 2022

dkonstan commented Oct 11, 2022 via email

orbeckst commented Mar 11, 2023

hanging while reading in gromacs trr trajectory #3789

hanging while reading in gromacs trr trajectory #3789

Comments

dkonstan commented Aug 24, 2022

Expected behavior

Actual behavior

Code to reproduce the behavior

Current version of MDAnalysis

richardjgowers commented Aug 24, 2022

dkonstan commented Aug 24, 2022 via email

richardjgowers commented Aug 24, 2022

orbeckst commented Aug 25, 2022

orbeckst commented Aug 25, 2022

richardjgowers commented Aug 25, 2022

orbeckst commented Aug 25, 2022

jbarnoud commented Aug 26, 2022

dkonstan commented Oct 11, 2022 via email

orbeckst commented Mar 11, 2023