Continuous integration failed to detect bug in solver Hackage benchmarks #9495

grayjay · 2023-12-06T02:12:55Z

#6447 added a short run of the solver Hackage benchmark to CI to test that it can run without errors. However, I noticed a bug where the benchmark fails to run when the cabal config files have not been initialized (#9494). This issue is for checking whether CI is still correctly testing the benchmark.

Mikolaj · 2023-12-06T07:15:13Z

For a start, does the test run when testing cabal locally according to the relevant instructions?

grayjay · 2023-12-06T19:20:02Z

@Mikolaj Which instructions are you referring to? CONTRIBUTING.md doesn't mention the Hackage benchmark, though it probably should. The benchmark should be able to run locally after #9494.

grayjay · 2023-12-06T22:47:32Z

#9500 (8541514) showed that CI still passes when the benchmark calls die at the start.

grayjay · 2023-12-07T00:35:18Z

c332032 showed that CI fails when the benchmark doesn't compile.

EDIT: The step "Validate build" of the job "Validate ubuntu-latest ghc-9.2.8" failed, though all other "Validate" jobs were cancelled.

…#9495)" This reverts commit c332032.

This reverts commit 8541514.

grayjay · 2023-12-07T03:42:37Z

5ca936b showed that CI passes when the unit tests have a failure. The three CI runs mean that CI builds the Hackage benchmark but doesn't correctly run it or its unit tests.

grayjay · 2023-12-07T04:28:14Z

I'm not sure how to debug the CI next. It looks like validate.sh still attempts to run the benchmarks when the --solver-benchmarks flag is specified:

cabal/validate.sh

Lines 210 to 211 in a1cbd89

    
           --solver-benchmarks) 
        
               BENCHMARKS=true

cabal/validate.sh

Line 269 in a1cbd89

    
           if $BENCHMARKS; then STEPS="$STEPS solver-benchmarks-tests solver-benchmarks-run"; fi

cabal/validate.sh

Lines 464 to 477 in a1cbd89

    
           step_solver_benchmarks_tests() { 
        
           print_header "solver-benchmarks: test" 
        
           CMD="$($CABALLISTBIN solver-benchmarks:test:unit-tests)" 
        
           (cd Cabal && timed $CMD) || exit 1 
        
           } 
        
           step_solver_benchmarks_run() { 
        
           print_header "solver-benchmarks: run" 
        
           SOLVEPKG=Chart-diagrams 
        
           CMD="$($CABALLISTBIN solver-benchmarks:exe:hackage-benchmark) --cabal1=$CABAL --cabal2=$($CABALLISTBIN cabal-install:exe:cabal) --trials=5 --packages=$SOLVEPKG --print-trials" 
        
           (cd Cabal && timed $CMD) || exit 1 
        
           }

The --solver-benchmarks flag is used in two places, in .docker/validate-8.8.4.dockerfile

cabal/.docker/validate-8.8.4.dockerfile

Line 76 in a1cbd89

    
           RUN     sh ./validate.sh --doctest --solver-benchmarks --complete-hackage -w ghc-8.8.4 -v

cabal/cabal-dev-scripts/src/GenValidateDockerfile.hs

Line 44 in a1cbd89

    
           , pair "8.8.4"  $ Z "ghc-8.8.4"  "8.8.4-bionic"  False True  False True  "--doctest --solver-benchmarks --complete-hackage"

and .github/workflows/validate.yml

cabal/.github/workflows/validate.yml

Line 39 in a1cbd89

GHC_FOR_SOLVER_BENCHMARKS: '9.2.8'

cabal/.github/workflows/validate.yml

Lines 112 to 114 in a1cbd89

    
                     if [[ ${{ matrix.ghc }} == ${{ env.GHC_FOR_SOLVER_BENCHMARKS }} ]]; then 
        
                       FLAGS="$FLAGS --solver-benchmarks" 
        
                     fi

Mikolaj · 2023-12-07T08:58:04Z

EDIT: The step "Validate build" of the job "Validate ubuntu-latest ghc-9.2.8" failed, though all other "Validate" jobs were cancelled.

That's normal. When one job fails, others get cancelled. So this case is fine.

Mikolaj · 2023-12-07T09:21:51Z

Good detective work. Yes, I confirm in the raw log from https://github.com/haskell/cabal/actions/runs/7109262010/job/19402640597?pr=9494 that benchmarks are built for GHC 9.2.8. However, they are not tested nor run, because validate.sh is no longer run as whole, to execute all the steps, but instead validate.yml decides which steps to execute and calls validate.sh for each separately. Unfortunately, it seems somebody did not cover the benchmark test and run steps in validate.yml (nor the time-summary step that should come after; no idea if it's still useful).

So this looks like an omission in validate.yml and it's hard to detect because of the duplication with validate.sh, where the benchmark steps are not omitted. I think we should just add the relevant steps to validate.yml, running them conditionally, probably just with the if [[ ${{ matrix.ghc }} == ${{ env.GHC_FOR_SOLVER_BENCHMARKS }} ]]; condition.

…#9495)

grayjay · 2024-01-18T21:47:23Z

@Mikolaj Thanks for the advice! I made the change that you suggested in #9625, and now CI fails with the error about a missing config file from #9494.

…#9495)

Add solver Hackage benchmarks to GitHub Actions (fixes #9495)

…#9495)" This reverts commit c332032.

This reverts commit 8541514.

grayjay · 2024-01-27T07:13:54Z

I tried introducing a failure into the benchmark unit tests now that #9625 has been merged (5655b89), and it caused the CI to fail, as expected.

Mikolaj · 2024-01-27T09:09:47Z

I tried introducing a failure into the benchmark unit tests now that #9625 has been merged (5655b89), and it caused the CI to fail, as expected.

Amazing! Success!

grayjay added type: bug cabal-install: solver continuous-integration labels Dec 6, 2023

grayjay added a commit to grayjay/cabal that referenced this issue Dec 6, 2023

Test Solver Hackage benchmark (issue haskell#9495)

8541514

grayjay added a commit to grayjay/cabal that referenced this issue Dec 6, 2023

Test compile error in Solver Hackage benchmark (issue haskell#9495)

c332032

grayjay added a commit to grayjay/cabal that referenced this issue Dec 7, 2023

Revert "Test compile error in Solver Hackage benchmark (issue haskell…

e390e0c

…#9495)" This reverts commit c332032.

grayjay added a commit to grayjay/cabal that referenced this issue Dec 7, 2023

Revert "Test Solver Hackage benchmark (issue haskell#9495)"

ef1aaf0

This reverts commit 8541514.

grayjay added a commit to grayjay/cabal that referenced this issue Dec 7, 2023

Test unit test error in Solver Hackage benchmark (issue haskell#9495)

5ca936b

grayjay added a commit to grayjay/cabal that referenced this issue Jan 18, 2024

Add solver Hackage benchmarks to GitHub Actions (fixes haskell#9495)

4bb7854

grayjay added a commit to grayjay/cabal that referenced this issue Jan 18, 2024

fixup! Add solver Hackage benchmarks to GitHub Actions (fixes haskell…

2764f57

…#9495)

grayjay added a commit to grayjay/cabal that referenced this issue Jan 22, 2024

Add solver Hackage benchmarks to GitHub Actions (fixes haskell#9495)

47e2f96

grayjay added a commit to grayjay/cabal that referenced this issue Jan 22, 2024

fixup! Add solver Hackage benchmarks to GitHub Actions (fixes haskell…

ea76037

…#9495)

grayjay added a commit to grayjay/cabal that referenced this issue Jan 22, 2024

Add solver Hackage benchmarks to GitHub Actions (fixes haskell#9495)

1d2d6b7

grayjay added a commit to grayjay/cabal that referenced this issue Jan 22, 2024

fixup! Add solver Hackage benchmarks to GitHub Actions (fixes haskell…

ce54499

…#9495)

grayjay added a commit to grayjay/cabal that referenced this issue Jan 24, 2024

Add solver Hackage benchmarks to GitHub Actions (fixes haskell#9495)

31476b8

grayjay added a commit to grayjay/cabal that referenced this issue Jan 24, 2024

fixup! Add solver Hackage benchmarks to GitHub Actions (fixes haskell…

35ac6e5

…#9495)

grayjay added a commit to grayjay/cabal that referenced this issue Jan 24, 2024

Add solver Hackage benchmarks to GitHub Actions (fixes haskell#9495)

fcfe445

mergify bot closed this as completed in c0dcbde Jan 26, 2024

mergify bot added a commit that referenced this issue Jan 26, 2024

Merge pull request #9625 from grayjay/issue-9495

0800125

Add solver Hackage benchmarks to GitHub Actions (fixes #9495)

grayjay added a commit to grayjay/cabal that referenced this issue Jan 27, 2024

Test Solver Hackage benchmark (issue haskell#9495)

3339094

grayjay added a commit to grayjay/cabal that referenced this issue Jan 27, 2024

Test compile error in Solver Hackage benchmark (issue haskell#9495)

0c38b62

grayjay added a commit to grayjay/cabal that referenced this issue Jan 27, 2024

Revert "Test compile error in Solver Hackage benchmark (issue haskell…

d2631b3

…#9495)" This reverts commit c332032.

grayjay added a commit to grayjay/cabal that referenced this issue Jan 27, 2024

Revert "Test Solver Hackage benchmark (issue haskell#9495)"

0b34d81

This reverts commit 8541514.

grayjay added a commit to grayjay/cabal that referenced this issue Jan 27, 2024

Test unit test error in Solver Hackage benchmark (issue haskell#9495)

5655b89

erikd pushed a commit to erikd/cabal that referenced this issue Apr 22, 2024

Add solver Hackage benchmarks to GitHub Actions (fixes haskell#9495)

21f43e4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continuous integration failed to detect bug in solver Hackage benchmarks #9495

Continuous integration failed to detect bug in solver Hackage benchmarks #9495

grayjay commented Dec 6, 2023

Mikolaj commented Dec 6, 2023

grayjay commented Dec 6, 2023

grayjay commented Dec 6, 2023

grayjay commented Dec 7, 2023 •

edited

Loading

grayjay commented Dec 7, 2023

grayjay commented Dec 7, 2023

Mikolaj commented Dec 7, 2023

Mikolaj commented Dec 7, 2023

grayjay commented Jan 18, 2024

grayjay commented Jan 27, 2024

Mikolaj commented Jan 27, 2024

Continuous integration failed to detect bug in solver Hackage benchmarks #9495

Continuous integration failed to detect bug in solver Hackage benchmarks #9495

Comments

grayjay commented Dec 6, 2023

Mikolaj commented Dec 6, 2023

grayjay commented Dec 6, 2023

grayjay commented Dec 6, 2023

grayjay commented Dec 7, 2023 • edited Loading

grayjay commented Dec 7, 2023

grayjay commented Dec 7, 2023

Mikolaj commented Dec 7, 2023

Mikolaj commented Dec 7, 2023

grayjay commented Jan 18, 2024

grayjay commented Jan 27, 2024

Mikolaj commented Jan 27, 2024

grayjay commented Dec 7, 2023 •

edited

Loading