[FIX] Restructuring #303

elephaint · 2024-10-25T15:07:44Z

Put source files in src folder
Restructure methods adding Sparse methods to docs
Update CI to improve speed of tests
Update model performance tests to test more models
Docs only update on release or workflow dispatch, not on push to main
Move Numba functions to utils, perform eager compilation and micro optimization of lasso function.
Use double dtype everywhere; reconcilers are sensitive to low(er) precision, for now unify to double and maybe in the future turn everything to single

Lasso (ERM-reg / ERM-reg_bu) micro optimization before and after (the method is still slow as **** ):

review-notebook-app · 2024-10-25T15:07:50Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

.github/release-drafter.yml

.github/workflows/python-publish.yml

action_files/test_models/src/data.py

action_files/test_models/src/evaluation.py

action_files/test_models/src/models.py

hierarchicalforecast/core.py

hierarchicalforecast/utils.py

elephaint · 2024-10-30T14:48:44Z

Test for lasso equivalence of methods (removed it from the code to keep it clean):

#| hide
# Lasso test old vs new
@njit
def _lasso_old(X: np.ndarray, y: np.ndarray, 
          lambda_reg: float, max_iters: int = 1_000,
          tol: float = 1e-4):
    # lasso cyclic coordinate descent
    n, feats = X.shape
    norms = (X ** 2).sum(axis=0)
    beta = np.zeros(feats, dtype=np.float32)
    beta_changes = np.zeros(feats, dtype=np.float32)
    residuals = y.copy()

    for it in range(max_iters):
        for i, betai in enumerate(beta):
            # is feature is close to zero, we 
            # continue to the next.
            # in this case is optimal betai= 0
            # print(beta)

            if abs(norms[i]) < 1e-8:
                continue
            xi = X[:, i]
            #we calculate the normalized derivative
            rho = betai + xi.flatten().dot(residuals) / norms[i] #(norms[i] + 1e-3)
            #soft threshold
            beta[i] = np.sign(rho) * max(np.abs(rho) - lambda_reg * n / norms[i], 0.)#(norms[i] + 1e-3), 0.)
            beta_changes[i] = np.abs(betai - beta[i])
            if beta[i] != betai:
                residuals += (betai - beta[i]) * xi
        if max(beta_changes) < tol:
            break
    #print(it)
    return beta


S = np.array([[1., 1., 1., 1.],
              [1., 1., 0., 0.],
              [0., 0., 1., 1.],
              [1., 0., 0., 0.],
              [0., 1., 0., 0.],
              [0., 0., 1., 0.],
              [0., 0., 0., 1.]])
h = 2
_y = np.array([10., 5., 4., 2., 1.])
y_bottom = np.vstack([i * _y for i in range(1, 5)])
y_hat_bottom_insample = np.roll(y_bottom, 1)
y_hat_bottom = np.vstack([i * np.ones(h) for i in range(1, 5)])
idx_bottom = [3, 4, 5, 6]

y_hat=S @ y_hat_bottom
y_insample=S @ y_bottom
idx_bottom=idx_bottom

n_hiers, n_bottom = S.shape

for method in ["reg", "reg_bu"]:
    for lambda_reg in [None, 0.1, 10, 1000]:
        for with_nans in [True, False]:
            y_hat_bottom_insample = np.roll(y_bottom, 1)
            if with_nans:
                y_hat_bottom_insample[:, 0] = np.nan
            y_hat_insample=S @ y_hat_bottom_insample
            y_insample=S @ y_bottom

            nan_idx = np.isnan(y_hat_insample).any(axis=0)
            y_insample = y_insample[:, ~nan_idx]
            y_hat_insample = y_hat_insample[:, ~nan_idx]
            h = min(y_hat.shape[1], y_hat_insample.shape[1])
            y_hat_insample = y_hat_insample[:, -h:] # shape (h, n_hiers)
            y_insample = y_insample[:, -h:]

            if method == 'reg':
                X = np.kron(S, y_hat_insample.T)
                z = y_hat_insample.reshape(-1)

                if lambda_reg is None:
                    lambda_reg = np.max(np.abs(X.T.dot(z)))
                else:
                    lambda_reg = lambda_reg            

                beta_old = _lasso_old(X, z, lambda_reg)
                beta_new = _lasso(X, z, lambda_reg, max_iters=1000, tol=1e-4) 
                np.testing.assert_allclose(beta_new, beta_old, atol=1e-5, rtol=1e-3)

            if method == 'reg_bu':
                X = np.kron(S, y_hat_insample.T)
                Pbu = np.zeros_like(S)
                Pbu[idx_bottom] = S[idx_bottom]
                z = y_hat_insample.reshape(-1) - X @ Pbu.reshape(-1)
                    
                if lambda_reg is None:
                    lambda_reg = np.max(np.abs(X.T.dot(z)))
                else:
                    lambda_reg = lambda_reg             

                beta_old = _lasso_old(X, z, lambda_reg)
                beta_new = _lasso(X, z, lambda_reg, max_iters=1000, tol=1e-4) 
                np.testing.assert_allclose(beta_new, beta_old, atol=1e-5, rtol=1e-3)

jmoralez

Non blocking comments.

action_files/test_models/src/data.py

action_files/test_models/src/evaluation.py

action_files/test_models/src/models.py

restructure_actions

0884699

elephaint added 18 commits October 25, 2024 17:14

add_deps_not_sync_them

2009daf

clean_up_config_cci

da3f873

fix_cci

d760119

eager_numba_and_floats_to_double_and_micro_optimization_erm

61a3a58

cci_error

6522f71

cci_error

e0370a9

change_flags

9b378a3

correct_erm

0f3fcca

test_larger_for_cci

1835d27

restruct_docs

682330a

imgs_back_to_examples

9e737d8

add_sparse_to_docs

d026575

test_benchmark_artifacts

6f55f86

test_benchmark_artifacts

2dc2a85

test_benchmark_artifacts

7bfcdf2

strict_hierarchy_test

aefc582

add_tests

e94446e

fix_tests

efa2a77

elephaint requested a review from jmoralez October 29, 2024 22:41

elephaint and others added 2 commits October 29, 2024 23:49

fix_timing

b91cd7f

Merge branch 'main' into docs-restructuring

9da8923

jmoralez reviewed Oct 29, 2024

View reviewed changes

jmoralez reviewed Oct 30, 2024

View reviewed changes

hierarchicalforecast/utils.py Show resolved Hide resolved

elephaint added 5 commits October 30, 2024 11:41

jose_comments

34d581b

lasso_test

27c5ab2

add_lasso_test

d81896e

add_model_evaluation_and_readme_fixes

1bc4992

remove_lasso_test

93b5ea0

elephaint requested a review from jmoralez October 30, 2024 14:50

jmoralez approved these changes Oct 30, 2024

View reviewed changes

action_files/test_models/src/data.py Show resolved Hide resolved

action_files/test_models/src/evaluation.py Show resolved Hide resolved

action_files/test_models/src/models.py Show resolved Hide resolved

elephaint merged commit bf944f2 into main Oct 30, 2024
17 checks passed

elephaint deleted the docs-restructuring branch October 30, 2024 18:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] Restructuring #303

[FIX] Restructuring #303

elephaint commented Oct 25, 2024 •

edited

Loading

review-notebook-app bot commented Oct 25, 2024

elephaint commented Oct 30, 2024

jmoralez left a comment

[FIX] Restructuring #303

[FIX] Restructuring #303

Conversation

elephaint commented Oct 25, 2024 • edited Loading

review-notebook-app bot commented Oct 25, 2024

elephaint commented Oct 30, 2024

jmoralez left a comment

Choose a reason for hiding this comment

elephaint commented Oct 25, 2024 •

edited

Loading