Add benchmarks for subresultants PRS method #94

1e9abhi1e10 · 2023-07-16T17:02:54Z

This PR adds the benchmarks for the subresultants PRS method.
Related Issue:
#88

1e9abhi1e10 · 2023-07-16T17:12:31Z

The benchmarks for subresultants are failing due to the teardown function. I am working on it!

benchmarks/polys.py

1e9abhi1e10 · 2023-07-16T23:10:18Z

This is the current output looks like

[ 42.32%] ··· ...UBRESULTANTS_LinearDenseQuadraticGCD.time_op                 ok
[ 42.32%] ··· ====== ============= ============= =============
              --                        impl                  
              ------ -----------------------------------------
               size       expr         dense         sparse   
              ====== ============= ============= =============
                1     2.07±0.01ms     541±2μs       1.42±0ms  
                2     4.62±0.01ms   3.17±0.01ms     7.01±0ms  
                3       9.93±0ms    18.1±0.04ms   31.2±0.04ms 
              ====== ============= ============= =============

[ 42.48%] ··· ...meSUBRESULTANTS_QuadraticNonMonicGCD.time_op                 ok
[ 42.48%] ··· ====== ============= ============= =============
              --                        impl                  
              ------ -----------------------------------------
               size       expr         dense         sparse   
              ====== ============= ============= =============
                1     1.24±0.01ms     1.21±0ms    7.82±0.01ms 
                2       3.78±0ms    7.28±0.01ms   18.0±0.04ms 
                3     18.8±0.05ms    157±0.2ms     248±0.3ms  
              ====== ============= ============= =============

[ 42.65%] ··· ...imeSUBRESULTANTS_SparseGCDHighDegree.time_op                 ok
[ 42.65%] ··· ====== ========== ============= =============
              --                      impl                 
              ------ --------------------------------------
               size     expr        dense         sparse   
              ====== ========== ============= =============
                1     970±4μs     253±0.4μs     497±0.9μs  
                2     1.32±0ms     1.42±0ms    2.03±0.01ms 
                3     1.84±0ms   6.59±0.02ms   7.81±0.01ms 
                5     3.20±0ms   30.8±0.02ms    34.1±0.1ms 
              ====== ========== ============= =============

[ 42.81%] ··· ...UBRESULTANTS_SparseNonMonicQuadratic.time_op                 ok
[ 42.81%] ··· ====== ========= ============= =============
              --                      impl                
              ------ -------------------------------------
               size     expr       dense         sparse   
              ====== ========= ============= =============
                1     632±1μs     255±1μs      673±0.6μs  
                2     701±2μs    598±0.9μs      1.11±0ms  
                3     759±1μs   7.33±0.02ms   8.05±0.02ms 
                5     885±2μs   20.3±0.05ms   21.2±0.02ms 
              ====== ========= ============= =============

oscarbenjamin · 2023-07-17T10:09:25Z

I think that this looks good but I think that we should reduce the total number of benchmarks that are reported for both PREM and subresultants.

You can see what it currently looks like when benchmark results are reported to a PR here:
sympy/sympy#25371 (comment)

This PR would double that. Now that we can see these examples and how the benchmarks compare the question is: which of these different benchmarks is potentially reporting something interesting?

For GCD probably all cases are interesting (they were designed for GCD) but for PREM it does not seem that e.g. the TimePREM_SparseNonMonicQuadratic timings demonstrate something that is particularly informative compared to what the TimePREM_SparseGCDHighDegree timings show.

oscarbenjamin · 2023-07-17T10:14:59Z

As an aside the timings here comparing expr, dense and sparse demonstrate a common problem with the sympy benchmark suite. The expr timings suggest that it is faster than both dense and sparse in most cases (up to 20x faster for larger problems). This is almost certainly not true though: under the hood the expr version of e.g. prem uses either the dense or sparse implementation so it is necessarily slower.

The reason for this is that many SymPy operations (with expr) use a cache and the benchmarks report timings from repeating an operation many times while the cache is in operation. It means that the timings reported for expr are often only really representative of the performance of the cache rather than the actual time that it takes to compute something that is not already cached.

1e9abhi1e10 · 2023-07-17T21:47:44Z

I think that this looks good but I think that we should reduce the total number of benchmarks that are reported for both PREM and subresultants.

subresultants alternatively use prem , so I don't think that we should keep prem benchmarks
It's good for us to keep the benchmarks of subresultants and gcd.

1e9abhi1e10 · 2023-07-17T21:49:16Z

As an aside the timings here comparing expr, dense and sparse demonstrate a common problem with the sympy benchmark suite. The expr timings suggest that it is faster than both dense and sparse in most cases (up to 20x faster for larger problems). This is almost certainly not true though: under the hood the expr version of e.g. prem uses either the dense or sparse implementation so it is necessarily slower.

The reason for this is that many SymPy operations (with expr) use a cache and the benchmarks report timings from repeating an operation many times while the cache is in operation. It means that the timings reported for expr are often only really representative of the performance of the cache rather than the actual time that it takes to compute something that is not already cached.

Should we remove the expr or is there any method so that we can able to calculate the actual timing of expr?

oscarbenjamin · 2023-07-17T22:20:37Z

Should we remove the expr or is there any method so that we can able to calculate the actual timing of expr?

No, we can leave it there. We just need to recognise that those benchmarks are not always representative of any actual improvement or regression in performance.

oscarbenjamin · 2023-07-17T22:22:12Z

subresultants alternatively use prem , so I don't think that we should keep prem benchmarks
It's good for us to keep the benchmarks of subresultants and gcd.

I would like to keep something for PREM because perhaps in future it would not be used by subresultants for example. I just don't think we need so many benchmarks for PREM. Also if the different cases are not particularly interesting for revealing different aspects of the performance of subresultants then we don't need so many cases for that either.

1e9abhi1e10 · 2023-07-18T06:59:20Z

subresultants alternatively use prem , so I don't think that we should keep prem benchmarks
It's good for us to keep the benchmarks of subresultants and gcd.

I would like to keep something for PREM because perhaps in future it would not be used by subresultants for example. I just don't think we need so many benchmarks for PREM. Also if the different cases are not particularly interesting for revealing different aspects of the performance of subresultants then we don't need so many cases for that either.

We should keep TimePREM_LinearDenseQuadraticGCD and TimePREM_QuadraticNonMonicGCD for PREM and
TimeSUBRESULTANTS_LinearDenseQuadraticGCD , TimeSUBRESULTANTS_QuadraticNonMonicGCD and TimeSUBRESULTANTS_SparseNonMonicQuadratic for subresultants ?

oscarbenjamin · 2023-07-18T09:57:57Z

We should keep TimePREM_LinearDenseQuadraticGCD and TimePREM_QuadraticNonMonicGCD for PREM and
TimeSUBRESULTANTS_LinearDenseQuadraticGCD , TimeSUBRESULTANTS_QuadraticNonMonicGCD and TimeSUBRESULTANTS_SparseNonMonicQuadratic for subresultants ?

Why in particular those benchmarks?

Or is that just a random choice?

Do TimePREM_LinearDenseQuadraticGCD and TimePREM_QuadraticNonMonicGCD demonstrate different aspects of the performance of prem somehow?

Are the other cases less relevant than those two?

1e9abhi1e10 · 2023-07-18T10:05:43Z

For TimePREM_SparseGCDHighDegree the polynomials have a high degree of GCD sparse. While a degree is high, sparsity might make the pseudo remainder method less efficient compared to other approaches like other modular methods.

1e9abhi1e10 · 2023-07-18T10:09:47Z

We can also reduce the values of size or n
params = [(1, 3, 5, 8), ('expr', 'dense', 'sparse'), maybe the two values will be enough.

oscarbenjamin · 2023-07-18T11:09:36Z

For TimePREM_SparseGCDHighDegree the polynomials have a high degree of GCD sparse. While a degree is high, sparsity might make the pseudo remainder method less efficient compared to other approaches like other modular methods.

Okay, well that seems like a good reason. Maybe some docstrings should be added to the benchmark cases explaining why they are chosen. Also it would be good to add docstrings to the examples types like _LinearDenseQuadraticGCD so that we can have some idea when looking at the code what the polynomials they generate will look like.

I put some docstrings in the suggested code at #89 (comment) as an illustration of how that might look. Perhaps it would just be good to pick a particular n and show what f, g and d would be for that case explaining how n relates to the generated polynomials.

1e9abhi1e10 · 2023-07-19T15:09:24Z

@oscarbenjamin Please have a look!

benchmarks/polys.py

oscarbenjamin · 2023-07-20T11:35:35Z

Okay, looks good.

1e9abhi1e10 · 2023-07-20T11:42:27Z

thanks!

Add benchmarks for subresultants PRS method

361b888

1e9abhi1e10 commented Jul 16, 2023

View reviewed changes

benchmarks/polys.py Outdated Show resolved Hide resolved

1e9abhi1e10 added 2 commits July 17, 2023 02:29

fixes assertion error in teardown function

b9b3ba1

change name

e4ad8f9

add docstring for clear understandings

7f26297

oscarbenjamin reviewed Jul 19, 2023

View reviewed changes

benchmarks/polys.py Outdated Show resolved Hide resolved

1e9abhi1e10 added 2 commits July 19, 2023 23:28

improve code quality

487e981

improve line wrapping

aded1b1

oscarbenjamin reviewed Jul 19, 2023

View reviewed changes

benchmarks/polys.py Outdated Show resolved Hide resolved

improve formatting of code

ef0a824

oscarbenjamin merged commit 8b75a9a into sympy:master Jul 20, 2023

1e9abhi1e10 mentioned this pull request Jul 20, 2023

[GSoC'23] Add subresultants PRS method using sparse representation sympy/sympy#25371

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmarks for subresultants PRS method #94

Add benchmarks for subresultants PRS method #94

1e9abhi1e10 commented Jul 16, 2023

1e9abhi1e10 commented Jul 16, 2023

1e9abhi1e10 commented Jul 16, 2023 •

edited

Loading

oscarbenjamin commented Jul 17, 2023

oscarbenjamin commented Jul 17, 2023

1e9abhi1e10 commented Jul 17, 2023 •

edited

Loading

1e9abhi1e10 commented Jul 17, 2023 •

edited

Loading

oscarbenjamin commented Jul 17, 2023

oscarbenjamin commented Jul 17, 2023

1e9abhi1e10 commented Jul 18, 2023

oscarbenjamin commented Jul 18, 2023

1e9abhi1e10 commented Jul 18, 2023

1e9abhi1e10 commented Jul 18, 2023

oscarbenjamin commented Jul 18, 2023

1e9abhi1e10 commented Jul 19, 2023

oscarbenjamin commented Jul 20, 2023

1e9abhi1e10 commented Jul 20, 2023

Add benchmarks for subresultants PRS method #94

Add benchmarks for subresultants PRS method #94

Conversation

1e9abhi1e10 commented Jul 16, 2023

1e9abhi1e10 commented Jul 16, 2023

1e9abhi1e10 commented Jul 16, 2023 • edited Loading

oscarbenjamin commented Jul 17, 2023

oscarbenjamin commented Jul 17, 2023

1e9abhi1e10 commented Jul 17, 2023 • edited Loading

1e9abhi1e10 commented Jul 17, 2023 • edited Loading

oscarbenjamin commented Jul 17, 2023

oscarbenjamin commented Jul 17, 2023

1e9abhi1e10 commented Jul 18, 2023

oscarbenjamin commented Jul 18, 2023

1e9abhi1e10 commented Jul 18, 2023

1e9abhi1e10 commented Jul 18, 2023

oscarbenjamin commented Jul 18, 2023

1e9abhi1e10 commented Jul 19, 2023

oscarbenjamin commented Jul 20, 2023

1e9abhi1e10 commented Jul 20, 2023

1e9abhi1e10 commented Jul 16, 2023 •

edited

Loading

1e9abhi1e10 commented Jul 17, 2023 •

edited

Loading

1e9abhi1e10 commented Jul 17, 2023 •

edited

Loading