Make mp4 parsing a lot faster & tremendously lower memory overhead #7860

jprochazk · 2024-10-22T12:21:10Z

What

Some rough numbers:

main@de4c389   297.43   µs
this branch      3.8957 µs

Measured on Big_Buck_Bunny_1080_10s_av1.mp4 from our test assets (downloaded via tests/assets/download_test_assets.py)

Total Rerun memory usage for a 297 MiB video (the entire Sintel movie):

main@de4c389   573 MiB
this branch    325 MiB

Checklist

I have read and agree to Contributor Guide and the Code of Conduct
I've included a screenshot or gif (if applicable)
I have tested the web demo (if applicable):
- Using examples from latest main build: rerun.io/viewer
- Using full set of examples from nightly build: rerun.io/viewer
The PR title and labels are set such as to maximize their usefulness for the next release's CHANGELOG
If applicable, add a new check to the release checklist!
If have noted any breaking changes to the log API in CHANGELOG.md and the migration guide

To run all checks from main, comment on the PR with @rerun-bot full-check.

jprochazk · 2024-10-22T12:27:38Z

We still copy each sample when we enqueue a chunk, which feels like it should not be necessary. It would be nice to have direct access to the video data arrow buffer in decoders, so that we can index into it there, but I wasn't 100% sure how to make that work here so I left it out.

Wumpf

similar to the commentary on the parent PR I'm super happy about this being so low-hanging, but also like there the distinction of "what data is this" needs more love. In partiulcar VideoData is already a type and now we're passing a lot of video_data which is not related to it.
Ideally we'd use some reference counted object from the guts of Blob, but I haven't looked into whether that's viable (we really don't want to end up passing around raw arrow buffers, infecting everything with that dependency)

If we advertise this in the changelog, we should have before after perf & memory numbers. Just give it a quick spin with one of the larger videos on our internal video repo, thank you! :)

crates/store/re_video/examples/frames.rs

crates/store/re_video/src/demux/mod.rs

crates/viewer/re_data_ui/src/blob.rs

jprochazk · 2024-10-22T13:26:25Z

(trying to measure this by adding a benchmark)

jprochazk · 2024-10-22T13:34:45Z

Some rough numbers:

main@de4c389   297.43   µs
this branch      3.8957 µs

Measured on Big_Buck_Bunny_1080_10s_av1.mp4 from our test assets (downloaded via tests/assets/download_test_assets.py)

Total Rerun memory usage for a 297 MiB video (the entire Sintel movie):

main@de4c389   573 MiB
this branch    325 MiB

crates/store/re_video/benches/video_load_bench.rs

jprochazk · 2024-10-22T13:52:34Z

~~Running a nightly build to check if the bench works: https://github.com/rerun-io/rerun/actions/runs/11461677788~~ (it did not)

jprochazk · 2024-10-22T14:00:47Z

~~Bench was missing nasm for the build, maybe it works now: https://github.com/rerun-io/rerun/actions/runs/11461912756/job/31892063198 - will cancel it if the benchmark succeeds~~ - it downloaded the test assets and successfully compiled, good enough

Wumpf

awesome!

crates/store/re_video/benches/video_load_bench.rs

Wumpf · 2024-10-22T16:17:43Z

crates/store/re_video/benches/video_load_bench.rs

-        .unwrap()
-        .parent()
+        .ancestors()
+        .nth(3)


@jprochazk

### What This is now feasible thanks to @jprochazk's * #7860 ![image](https://github.com/user-attachments/assets/0cd063d9-7e44-4892-a751-022784e12d5d) ### Checklist * [x] I have read and agree to [Contributor Guide](https://github.com/rerun-io/rerun/blob/main/CONTRIBUTING.md) and the [Code of Conduct](https://github.com/rerun-io/rerun/blob/main/CODE_OF_CONDUCT.md) * [x] I've included a screenshot or gif (if applicable) * [x] I have tested the web demo (if applicable): * Using examples from latest `main` build: [rerun.io/viewer](https://rerun.io/viewer/pr/7869?manifest_url=https://app.rerun.io/version/main/examples_manifest.json) * Using full set of examples from `nightly` build: [rerun.io/viewer](https://rerun.io/viewer/pr/7869?manifest_url=https://app.rerun.io/version/nightly/examples_manifest.json) * [x] The PR title and labels are set such as to maximize their usefulness for the next release's CHANGELOG * [x] If applicable, add a new check to the [release checklist](https://github.com/rerun-io/rerun/blob/main/tests/python/release_checklist)! * [x] If have noted any breaking changes to the log API in `CHANGELOG.md` and the migration guide - [PR Build Summary](https://build.rerun.io/pr/7869) - [Recent benchmark results](https://build.rerun.io/graphs/crates.html) - [Wasm size tracking](https://build.rerun.io/graphs/sizes.html) To run all checks from `main`, comment on the PR with `@rerun-bot full-check`.

don't copy mp4 data

e24d9a1

jprochazk added 🚀 performance Optimization, memory use, etc include in changelog 🎞️ video labels Oct 22, 2024

fix frames example

9b7c972

Wumpf self-requested a review October 22, 2024 12:30

jprochazk mentioned this pull request Oct 22, 2024

Optimize mp4 parse times, and avoid duplicating their RAM use #7481

Closed

Wumpf changed the title ~~Optimize mp4 parsing~~ Make mp4 parsing faster & reduce memory overhead Oct 22, 2024

fix lint + add comment

ef01c8e

Wumpf requested changes Oct 22, 2024

View reviewed changes

crates/store/re_video/examples/frames.rs Outdated Show resolved Hide resolved

crates/store/re_video/src/demux/mod.rs Show resolved Hide resolved

crates/viewer/re_data_ui/src/blob.rs Outdated Show resolved Hide resolved

video_data -> video_blob

ffdd963

jprochazk added 2 commits October 22, 2024 15:31

video_data -> video_blob

da12cd3

add benchmark

f735952

Wumpf reviewed Oct 22, 2024

View reviewed changes

crates/store/re_video/benches/video_load_bench.rs Outdated Show resolved Hide resolved

jprochazk and others added 3 commits October 22, 2024 15:45

make it a thing

800ad13

Merge branch 'main' into jan/zerocopy-mp4-read

d9d235f

update rev

7e3bdf1

jprochazk added 2 commits October 22, 2024 15:56

fix lint

c02b1ef

run benches under pixi

6fedb6f

wrong lint name

562a7ab

jprochazk requested a review from Wumpf October 22, 2024 14:08

Wumpf approved these changes Oct 22, 2024

View reviewed changes

crates/store/re_video/benches/video_load_bench.rs Outdated Show resolved Hide resolved

Wumpf changed the title ~~Make mp4 parsing faster & reduce memory overhead~~ Make mp4 parsing a lot faster & reduce memory overhead Oct 22, 2024

Wumpf changed the title ~~Make mp4 parsing a lot faster & reduce memory overhead~~ Make mp4 parsing **a lot** faster & tremendously lower memory overhead Oct 22, 2024

less bad

ad4afc1

jprochazk merged commit 613a35b into main Oct 22, 2024
27 of 28 checks passed

jprochazk deleted the jan/zerocopy-mp4-read branch October 22, 2024 16:13

Wumpf reviewed Oct 22, 2024

View reviewed changes

crates/store/re_video/benches/video_load_bench.rs

.unwrap()

.parent()

.ancestors()

.nth(3)

Copy link

Member

Wumpf Oct 22, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤯

emilk mentioned this pull request Oct 23, 2024

Fix view creation heuristics for videos #7869

Merged

6 tasks

Wumpf mentioned this pull request Oct 23, 2024

CI: downloads test assets before executing tests #7874

Merged

6 tasks

jprochazk mentioned this pull request Oct 23, 2024

Don't copy video chunk data #7878

Open

talmo mentioned this pull request Nov 15, 2024

inaccurate seeking janclemenslab/napari-video#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make mp4 parsing a lot faster & tremendously lower memory overhead #7860

Make mp4 parsing a lot faster & tremendously lower memory overhead #7860

jprochazk commented Oct 22, 2024 •

edited by github-actions bot

Loading

jprochazk commented Oct 22, 2024

Wumpf left a comment •

edited

Loading

jprochazk commented Oct 22, 2024

jprochazk commented Oct 22, 2024 •

edited

Loading

jprochazk commented Oct 22, 2024 •

edited

Loading

jprochazk commented Oct 22, 2024 •

edited

Loading

Wumpf left a comment

Wumpf Oct 22, 2024

Make mp4 parsing **a lot** faster & tremendously lower memory overhead #7860

Make mp4 parsing **a lot** faster & tremendously lower memory overhead #7860

Conversation

jprochazk commented Oct 22, 2024 • edited by github-actions bot Loading

What

Checklist

jprochazk commented Oct 22, 2024

Wumpf left a comment • edited Loading

Choose a reason for hiding this comment

jprochazk commented Oct 22, 2024

jprochazk commented Oct 22, 2024 • edited Loading

jprochazk commented Oct 22, 2024 • edited Loading

jprochazk commented Oct 22, 2024 • edited Loading

Wumpf left a comment

Choose a reason for hiding this comment

Wumpf Oct 22, 2024

Choose a reason for hiding this comment

Make mp4 parsing a lot faster & tremendously lower memory overhead #7860

Make mp4 parsing a lot faster & tremendously lower memory overhead #7860

jprochazk commented Oct 22, 2024 •

edited by github-actions bot

Loading

Wumpf left a comment •

edited

Loading

jprochazk commented Oct 22, 2024 •

edited

Loading

jprochazk commented Oct 22, 2024 •

edited

Loading

jprochazk commented Oct 22, 2024 •

edited

Loading