Fix to bootstrap of free energy surfaces, affecting timing and quantitative results #535

mrshirts · 2024-08-26T04:20:07Z

Free energy surface code was calling MBAR after each call to randomizing bootstraps. It does not appear to affect the results, but slows things down by a factor of a little less than 2x (146 vs 87 seconds for one sample run).

…changed indices, taking too long.

codecov · 2024-08-26T04:23:10Z

Codecov Report

Attention: Patch coverage is 52.94118% with 8 lines in your changes missing coverage. Please review.

Project coverage is 69.70%. Comparing base (85e034c) to head (4ed45db).

Additional details and impacted files

mrshirts · 2024-08-26T04:28:23Z

Couple more changes I will put in to fix uncertainties, don't do anything yet to it . . .

mrshirts · 2024-08-26T15:58:55Z

OK, I think I got the changes in I needed to.

mikemhenry · 2024-08-26T16:35:49Z

@mrshirts when you are ready for review go ahead and add whoever you want to review this PR 😄

mrshirts · 2024-08-26T17:04:07Z

pymbar/fes.py

-                        self.u_kn[:, bootstrap_indices], self.N_k, initial_f_k=self.mbar.f_k
-                    )
-                    x_nb = x_n[bootstrap_indices]
+                # recompute MBAR.


This was unnecessary - it was running MBAR too many times. This saves approximately 2X time.

mrshirts

My comments on this for other people.

mrshirts · 2024-08-26T17:04:29Z

pymbar/fes.py

-                    fall[:, b] = h["f"] - h["f"][j]
-                df_i = np.std(fall, axis=1)
+                    fall[:, b] = h["f"] - h["f"][j]  # subtract out the reference bin
+                df_i = np.std(fall, ddof=1, axis=1)


Fixing the std definition.

mrshirts · 2024-08-26T17:04:38Z

pymbar/fes.py

@@ -1510,7 +1512,7 @@ def _get_fes_histogram(
                        fall[i, j, b] = (
                            histogram_datas[b]["f"] - histogram_datas[b]["f"].transpose()
                        )
-                dfxij_vals = np.std(fall, axis=2)
+                dfxij_vals = np.std(fall, ddof=1, axis=2)


Fixing std definition

mrshirts · 2024-08-26T17:04:58Z

pymbar/fes.py

-            kde = self.kde
-        kde.fit(x_n, sample_weight=self.w_n)
+            kde = self.kde  # use these new weights for the KDE
+            w_n = self.w_n


I actually can't remember if this was 100% necessary to get updated weights . . .

mrshirts · 2024-08-26T17:05:15Z

pymbar/fes.py

-                    fall[:, b] = h["f"] - h["f"][j]
-                df_i = np.std(fall, axis=1)
+                    fall[:, b] = h["f"] - h["f"][j]  # subtract out the reference bin
+                df_i = np.std(fall, ddof=1, axis=1)


Fix bootstrap std definition

mrshirts · 2024-08-26T17:05:41Z

pymbar/fes.py

@@ -1565,9 +1567,15 @@ def _get_fes_kde(
        if reference_point == "from-lowest":
            fmin = np.min(f_i)
            f_i = f_i - fmin
+            wheremin = np.argmin(


Need to find the location that this is zeroed at for the actual computation of the std.

mrshirts · 2024-08-26T17:05:49Z

pymbar/fes.py

        elif reference_point == "from-specified":
            fmin = -self.kde.score_samples(np.array(fes_reference).reshape(1, -1))
            f_i = f_i - fmin
+            wheremin = np.argmin(


Need to find the location that this is zeroed at for the actual computation of the std.

mrshirts · 2024-08-26T17:06:16Z

pymbar/fes.py

-                fall[:, b] = -self.kdes[b].score_samples(x) - fmin
-            df_i = np.std(fall, axis=1)
+                fall[:, b] = -self.kdes[b].score_samples(x)
+                fall[:, b] -= fall[wheremin, b]


Zero out at the correct location.

mrshirts · 2024-08-28T14:40:23Z

Suggestions for anyone else who should review - or if there's anyone who could take a look? We are looking at some free energy surface problems for OpenFF, so we want to get this through.

Fixing the bootstrap indices - it was calling MBAR after each set of …

2bc1a00

…changed indices, taking too long.

mrshirts added 3 commits August 26, 2024 09:18

Fixed for standard deviations and bootstrap errors for KDE.

c1d4cf0

Fix the resets.

e9f299d

removing debugging.

d409818

mrshirts added 4 commits August 26, 2024 10:05

fix formatting for black.

42e04ab

some fixes for lint.

bc7f5b8

lint checks.

185736d

more linting.

4ed45db

mrshirts self-assigned this Aug 26, 2024

mrshirts marked this pull request as draft August 26, 2024 17:02

mrshirts requested review from Lnaden, mikemhenry and maxentile August 26, 2024 17:02

mrshirts commented Aug 26, 2024

View reviewed changes

mrshirts changed the title ~~Fix to bootstrap of free energy surfaces, affecting timing~~ Fix to bootstrap of free energy surfaces, affecting timing and quantitative results Aug 26, 2024

maxentile removed their request for review August 26, 2024 17:38

mrshirts marked this pull request as ready for review August 29, 2024 19:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix to bootstrap of free energy surfaces, affecting timing and quantitative results #535

Fix to bootstrap of free energy surfaces, affecting timing and quantitative results #535

mrshirts commented Aug 26, 2024

codecov bot commented Aug 26, 2024 •

edited

Loading

mrshirts commented Aug 26, 2024

mrshirts commented Aug 26, 2024

mikemhenry commented Aug 26, 2024

mrshirts Aug 26, 2024

mrshirts left a comment

mrshirts Aug 26, 2024

mrshirts Aug 26, 2024

mrshirts Aug 26, 2024

mrshirts Aug 26, 2024

mrshirts Aug 26, 2024

mrshirts Aug 26, 2024

mrshirts Aug 26, 2024

mrshirts commented Aug 28, 2024

Fix to bootstrap of free energy surfaces, affecting timing and quantitative results #535

Are you sure you want to change the base?

Fix to bootstrap of free energy surfaces, affecting timing and quantitative results #535

Conversation

mrshirts commented Aug 26, 2024

codecov bot commented Aug 26, 2024 • edited Loading

Codecov Report

mrshirts commented Aug 26, 2024

mrshirts commented Aug 26, 2024

mikemhenry commented Aug 26, 2024

Choose a reason for hiding this comment

mrshirts left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mrshirts commented Aug 28, 2024

codecov bot commented Aug 26, 2024 •

edited

Loading