Speed up the MBAR calculation #357

xiki-tempula · 2024-05-15T17:20:15Z

I have adapted @mrshirts 's advice of using BAR results as initial guess into the MBAR, which seems to provide a 5-fold speed up.

from alchemtest.gmx import load_ABFE
from alchemlyb.parsing.gmx import extract_u_nk
import pandas as pd
df_list = [extract_u_nk(data, 310) for data in load_ABFE()['data']['complex']]
df = pd.concat(df_list)
mbar = MBAR(initial_f_k=None)
%timeit mbar.fit(df)
1.66 s ± 7.42 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)
mbar = MBAR(initial_f_k='BAR')
%timeit mbar.fit(df)
248 ms ± 2.6 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

I tried to optimise the convergence detection as well, where using the result from the previous MBAR run as input into the next MBAR run. Compared to using BAR as input, this approach still provides some speed up.

from alchemlyb.convergence import forward_backward_convergence
%timeit forward_backward_convergence(df_list)

Using BAR as initial guess
3.58 s ± 35.7 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)
Using the previous step as initial guess.
2.25 s ± 23.6 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

codecov · 2024-05-15T17:38:38Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.83%. Comparing base (55870c8) to head (8f94e37).
Report is 11 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #357   +/-   ##
=======================================
  Coverage   98.82%   98.83%           
=======================================
  Files          28       28           
  Lines        1875     1890   +15     
  Branches      405      409    +4     
=======================================
+ Hits         1853     1868   +15     
  Misses          2        2           
  Partials       20       20

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

orbeckst

Neat idea and cool that it gives a real performance boost.

Definitely needs documentation!

src/alchemlyb/convergence/convergence.py

orbeckst · 2024-05-15T20:52:48Z

.github/workflows/ci.yaml

      with:
-        token: ${{ secrets.CODECOV_TOKEN }}
+        token: ${{ secrets.CODECOV }}


Is this a change due to a merge or did you smuggle it into this PR??

So you could see that the CI failed for codecov token issues, I tried to see if I could fix it.

orbeckst · 2024-05-15T20:56:07Z

src/alchemlyb/convergence/convergence.py

-        result = estimator_fit(sample)
+        result = estimator_fit.fit(sample)
+        if estimator == "MBAR":
+            estimator_fit.initial_f_k = result.delta_f_.iloc[0, :]


I didn't fully follow the code path but how is delta_f already initialized with the BAR results?

So you did a MBAR calculation first result = estimator_fit.fit(sample)
Which populates the result.delta_f_ with for the current MBAR results.
This MBAR results is then propagated back as initial guess for the next calculation.

src/alchemlyb/estimators/mbar_.py

…mlyb into feat_fastMBAR

xiki-tempula · 2024-05-16T09:33:08Z

Some codecov is still failing for token issue but otherwise this seems fine. When this PR is merged. Do you mind if I do a new 2.2.1 release? I think the speed increase is quite significant.

orbeckst

Let's make a 2.3.0 release (see my suggested changes — you can just accept the whole bunch) and document carefully API change (new default).

Please add a test for MBAR(initial_nk=None) for the old behavior, given that default changed.

src/alchemlyb/estimators/mbar_.py

orbeckst · 2024-05-16T18:56:18Z

src/alchemlyb/estimators/mbar_.py

@@ -71,14 +80,19 @@ def __init__(
        self,
        maximum_iterations=10000,
        relative_tolerance=1.0e-7,
-        initial_f_k=None,
+        initial_f_k: np.ndarray | Literal["BAR"] | None = "BAR",


Changing the default is technically speaking an API change so we can't just do it in a patch release.

The conservative approach would be to keep the original default.

However, I think we want to make this speed-up available from release so my suggestion is to make a 2.3 release and make clearer that this changed — see other comments.

src/alchemlyb/estimators/mbar_.py

CHANGES

src/alchemlyb/tests/test_fep_estimators.py

Co-authored-by: Oliver Beckstein <[email protected]>

xiki-tempula added 4 commits May 15, 2024 19:18

update

bbd14fa

update

baf1c87

convergence

2c789d0

update

5ad7784

xiki-tempula and others added 4 commits May 15, 2024 19:41

make it work with lower version

90bbd9e

update toklen

915da04

update

0ee1470

Merge branch 'master' into feat_fastMBAR

c64daa1

orbeckst requested changes May 15, 2024

View reviewed changes

xiki-tempula and others added 4 commits May 16, 2024 10:44

Merge branch 'master' into feat_fastMBAR

ff55cd1

fix comments

489b538

Merge branch 'feat_fastMBAR' of https://github.com/xiki-tempula/alche…

f4c31d4

…mlyb into feat_fastMBAR

update

1dff4b4

xiki-tempula marked this pull request as ready for review May 16, 2024 09:32

xiki-tempula requested a review from orbeckst May 16, 2024 09:32

orbeckst requested changes May 16, 2024

View reviewed changes

xiki-tempula and others added 6 commits May 17, 2024 08:49

Update src/alchemlyb/estimators/mbar_.py

98cf2b7

Co-authored-by: Oliver Beckstein <[email protected]>

Update CHANGES

b2b9c31

Co-authored-by: Oliver Beckstein <[email protected]>

Update src/alchemlyb/estimators/mbar_.py

f8b85ed

Co-authored-by: Oliver Beckstein <[email protected]>

Update src/alchemlyb/estimators/mbar_.py

ce9d561

Co-authored-by: Oliver Beckstein <[email protected]>

Update CHANGES

63e2a60

Co-authored-by: Oliver Beckstein <[email protected]>

add new test

8f94e37

xiki-tempula mentioned this pull request May 17, 2024

Improve the error handling in the backward and forward convergence #358

Merged

orbeckst approved these changes May 17, 2024

View reviewed changes

orbeckst merged commit 46cc83b into alchemistry:master May 17, 2024
10 of 11 checks passed

xiki-tempula added a commit to xiki-tempula/alchemlyb that referenced this pull request May 18, 2024

Add the PR alchemistry#357 back

f54f28d

xiki-tempula deleted the feat_fastMBAR branch May 21, 2024 06:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up the MBAR calculation #357

Speed up the MBAR calculation #357

xiki-tempula commented May 15, 2024 •

edited

Loading

codecov bot commented May 15, 2024 •

edited

Loading

orbeckst left a comment

orbeckst May 15, 2024

xiki-tempula May 16, 2024

orbeckst May 15, 2024

xiki-tempula May 16, 2024

xiki-tempula commented May 16, 2024

orbeckst left a comment

orbeckst May 16, 2024

Speed up the MBAR calculation #357

Speed up the MBAR calculation #357

Conversation

xiki-tempula commented May 15, 2024 • edited Loading

codecov bot commented May 15, 2024 • edited Loading

Codecov Report

orbeckst left a comment

Choose a reason for hiding this comment

orbeckst May 15, 2024

Choose a reason for hiding this comment

xiki-tempula May 16, 2024

Choose a reason for hiding this comment

orbeckst May 15, 2024

Choose a reason for hiding this comment

xiki-tempula May 16, 2024

Choose a reason for hiding this comment

xiki-tempula commented May 16, 2024

orbeckst left a comment

Choose a reason for hiding this comment

orbeckst May 16, 2024

Choose a reason for hiding this comment

xiki-tempula commented May 15, 2024 •

edited

Loading

codecov bot commented May 15, 2024 •

edited

Loading