Time full program runtime #34

ElliottKasoar · 2023-11-17T15:12:45Z

Changes to timing method to resolve #33

The most significant change is to use /usr/bin/time to time the full runtime of each benchmark program (in addition to date as an extra check), for a more reliable value than timing within each loop (or even within the program but outside of loops, due to asynchronous functions etc.).
- Note: this approach may need further changes for testing on multiple nodes, but this is not currently an issue.
Removes the module deletion/creation timer, as this is not something that will realistically be called multiple times, and arguably a more meaningful idea of the effect can be obtained by varying N (e.g. running for N=1000 and N=10,000, with the module timings becoming increasingly insignificant).
Removes (and does not update) the large stride benchmarks from the benchmarking script
Adds hard-coded (for simplicity) way to use cuda flag in forpy via TorchScript model, allowing GPU comparisons
- The flag will not work without USETS=1 on compile time, but I'm not sure it's worth investing more time into forpy

To do:

Tidy up Python e.g. device comments/setting ("cpu" isn't set as the default as described, and more the docstring is unclear)
Update Python utils to read slurm output file with wall times
Update notebook with new plots/results (resolve Track benchmarking #16?)
Update changes to reflect changes for fypp preprocessor

TomMelt · 2023-12-15T15:22:54Z

Thanks @ElliottKasoar. I will try running this on my machine first, before I review it 👍

TomMelt · 2024-03-18T09:42:29Z

Code looks good. I am working with @surbhigoel77 to reproduce the tests on CSD3 before we merge

jatkinson1000 · 2024-04-29T10:26:22Z

@surbhigoel77 @TomMelt Is there any update on this?
We are just working through the project board recapping open issues.
Thanks.

jatkinson1000 · 2024-05-13T10:03:03Z

@surbhigoel77 @TomMelt Is there any update on this? We are just working through the project board recapping open issues. Thanks.

@surbhigoel77 @TomMelt
Please could you give us an update here?
This is still listed as 'in progress' on the projects board, does it need reverting or closing?

ElliottKasoar added 17 commits November 15, 2023 17:15

Remove module timer and enable GPU TorchScript

4cf06b3

Remove LS and use time binary in benchmarking script

1506cdf

Fix setting device for GPUs

38bc3bc

Update GPU benchmarking script to time full runtime

b98719c

Update benchmark read utils to read walltimes

11ccbd9

Remove unused module timing code

bc49c1f

Remove C types from resnet benchmark

ea9dc31

Remove C types from cgdrag explicit benchmark

ffcfbb6

Update torch_tensor_from_blob parameter order

2b9d780

Remove allocation in loop from default benchmarks

d1915f7

Update device docstrings

9bd1bee

Extend bar chart plotting options

49f582d

Fix tensor deletion

42bd2ca

Fix model deletion and comments

9b28858

Fix memory bug in MiMA benchmark

864d229

Add option to normalise copy of benchmark

0c1f4b6

Update notebook with optimisation comparisons

3001299

ElliottKasoar marked this pull request as ready for review December 15, 2023 14:56

ElliottKasoar requested review from TomMelt and jatkinson1000 December 15, 2023 14:56

Tidy code

dbc0e80

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Time full program runtime #34

Time full program runtime #34

ElliottKasoar commented Nov 17, 2023 •

edited

Loading

TomMelt commented Dec 15, 2023

TomMelt commented Mar 18, 2024

jatkinson1000 commented Apr 29, 2024

jatkinson1000 commented May 13, 2024

Time full program runtime #34

Are you sure you want to change the base?

Time full program runtime #34

Conversation

ElliottKasoar commented Nov 17, 2023 • edited Loading

TomMelt commented Dec 15, 2023

TomMelt commented Mar 18, 2024

jatkinson1000 commented Apr 29, 2024

jatkinson1000 commented May 13, 2024

ElliottKasoar commented Nov 17, 2023 •

edited

Loading