Fix recent FuzzBench cloud experiment failures #2023

DonggeLiu · 2024-08-12T12:49:05Z

Fix TypeError: expected str, bytes or os.PathLike object, not NoneType in 2024-08-10-test.

Traceback (most recent call last):
  File "/src/experiment/runner.py", line 468, in experiment_main
    runner.conduct_trial()
  File "/src/experiment/runner.py", line 290, in conduct_trial
    self.set_up_corpus_directories()
  File "/src/experiment/runner.py", line 275, in set_up_corpus_directories
    _unpack_clusterfuzz_seed_corpus(target_binary, input_corpus)
  File "/src/experiment/runner.py", line 144, in _unpack_clusterfuzz_seed_corpus
    seed_corpus_archive_path = get_clusterfuzz_seed_corpus_path(
  File "/src/experiment/runner.py", line 98, in get_clusterfuzz_seed_corpus_path
    fuzz_target_without_extension = os.path.splitext(fuzz_target_path)[0]
  File "/usr/local/lib/python3.10/posixpath.py", line 118, in splitext
    p = os.fspath(p)
TypeError: expected str, bytes or os.PathLike object, not NoneType

This happens on many benchmarks+fuzzers.
To be investigated later:

Why fuzz_target_path is None.
Why this did not happen in other recent experiments.
I thought I had seen this a long ago, Déjà vu?
Fixing No such file or directory: '/work/measurement-folders/<benchmark>-<fuzzer>/merged.json:

Traceback (most recent call last):
  File "/work/src/experiment/measurer/coverage_utils.py", line 74, in generate_coverage_report
    coverage_reporter.generate_coverage_summary_json()
  File "/work/src/experiment/measurer/coverage_utils.py", line 141, in generate_coverage_summary_json
    result = generate_json_summary(coverage_binary,
  File "/work/src/experiment/measurer/coverage_utils.py", line 269, in generate_json_summary
    with open(output_file, 'w', encoding='utf-8') as dst_file:
FileNotFoundError: [Errno 2] No such file or directory: '/work/measurement-folders/lcms_cms_transform_fuzzer-centipede/merged.json'

Remove incompatible benchmarks: openh264_decoder_fuzzer, stb_stbi_read_fuzzer

DonggeLiu · 2024-08-12T12:57:23Z

/gcbrun run_experiment.py -a --experiment-config /opt/fuzzbench/service/experiment-config.yaml --experiment-name 2024-08-12-dg --fuzzers aflplusplus centipede honggfuzz libfuzzer --benchmarks stb_stbi_read_fuzzer openh264_decoder_fuzzer

DonggeLiu · 2024-08-12T13:56:37Z

/gcbrun run_experiment.py -a --experiment-config /opt/fuzzbench/service/experiment-config.yaml --experiment-name 2024-08-12-2023 --fuzzers aflplusplus centipede honggfuzz libfuzzer --benchmarks stb_stbi_read_fuzzer openh264_decoder_fuzzer

DonggeLiu · 2024-08-12T13:59:51Z

Experiment 2024-08-12-2023 data and results will be available later at:
The experiment data.
The experiment report.
The experiment report(experimental).

DonggeLiu · 2024-08-13T00:32:55Z

This failed likely because both fuzz targets failed to generate coverage repots, e.g.:

Not sure if this related: OSS-Fuzz's build status page shows openh264_decoder_fuzzer failed.

DonggeLiu · 2024-08-13T00:40:49Z

/gcbrun run_experiment.py -a --experiment-config /opt/fuzzbench/service/experiment-config.yaml --experiment-name 2024-08-13-2023-libfuzzer-1 --fuzzers libfuzzer

DonggeLiu · 2024-08-13T00:56:08Z

Experiment 2024-08-13-2023-libfuzzer-1 data and results will be available later at:
The experiment data.
The experiment report.
The experiment report(experimental).

DonggeLiu · 2024-08-13T05:39:25Z

Report is back : ) @addisoncrump
I will wait a bit longer before merging this to ensure the report stays alive.
Once I merge this to master, could you please update your PR and bring back the changes you added?
Thanks!

addisoncrump · 2024-08-13T10:03:09Z

Sure, I'll rebase.

addisoncrump · 2024-08-13T12:50:28Z

@DonggeLiu I am able to build both openh264 and stb_stbi fuzzers as in master locally with no issue. Like #2021, I think this is a cache issue.

DonggeLiu · 2024-08-13T23:51:34Z

@DonggeLiu I am able to build both openh264 and stb_stbi fuzzers as in master locally with no issue. Like #2021, I think this is a cache issue.

I see, thanks for the info!
Given that you are investigating this, is there any help I can provide?
For example, if you think some more cloud build logs can save you time debugging, please feel free to add them and request experiments.
I can run them for you and send you the related logs.

DonggeLiu · 2024-08-13T23:57:17Z

Report on this PR is still not ready, likely due to some VMs were preemptied.
I will give it one more day just to be 100% safe.

addisoncrump · 2024-08-14T11:07:15Z

Given that you are investigating this, is there any help I can provide?

Ah, I was investigating the specific issue with the bug benchmark. I don't think I can offer much help with the CI or the fuzzbench infra directly. I can say, however, that the coverage benchmarks you removed do work as expected locally with test-run. I need to check if the coverage measurer works as anticipated; maybe this needs to be updated instead.

addisoncrump · 2024-08-14T11:30:40Z

Ah, @DonggeLiu, try running make test-run-coverage-all. It complains that it can't find bloaty_fuzz_target on master 👀

tokatoka · 2024-08-14T11:50:11Z

@DonggeLiu I am able to build both openh264 and stb_stbi fuzzers as in master locally with no issue. Like #2021, I think this is a cache issue.

For me the same, they are working. I don't think they should be removed

This reverts commit 50bdf34.

DonggeLiu · 2024-08-14T11:53:20Z

I see, thanks @addisoncrump and @tokatoka .
I've brought them back.

The experiment is about to finish, I will merge this tmr morning.

addisoncrump · 2024-08-14T12:56:59Z

I confirmed the coverage measurers build locally as well. Will test when everything has finished building.

addisoncrump · 2024-08-14T16:32:05Z

Yup, I tested openh264 and stb benchmarks locally and they do perform measurements as anticipated. The issue is with the GCP runs, I would presume a build cache issue.

DonggeLiu · 2024-08-15T00:23:02Z

Yup, I tested openh264 and stb benchmarks locally and they do perform measurements as anticipated. The issue is with the GCP runs, I would presume a build cache issue.

I see, I reckon this could be due to impatible GCP vm environment and llvm?
I will look into this once I finish other tasks in hand.

Just to double-check @addisoncrump :
When you test them locally, did you remove their old local images beforehand?

DonggeLiu · 2024-08-15T00:23:34Z

Thanks for the information again, @addisoncrump!

This reverts commit 4eb4f3b.

DonggeLiu · 2024-08-15T00:27:39Z

TBR by @jonathanmetzman.

The experiment that proving this works:
#2023 (comment)

addisoncrump · 2024-08-15T21:54:33Z

When you test them locally, did you remove their old local images beforehand?

Yes, I do a docker system prune --all before every experiment.

DonggeLiu · 2024-08-15T22:00:56Z

When you test them locally, did you remove their old local images beforehand?

Yes, I do a docker system prune --all before every experiment.

I see, thanks for confirming.
I will merge this then.

@addisoncrump

Temporarily disable benchmark `stb_stbi_read_fuzzer` and `openh264_decoder_fuzzer`from cloud experiments, becaue they are [proven](#2023 (comment)) to be incompatible in cloud build/run environment. @addisoncrump kindly confirmed that they [work in local experiments](#2023 (comment)).

tokatoka · 2024-08-27T12:31:38Z

I thought I had seen this a long ago, Déjà vu?

The same bug happened 1 year ago
#1886

DonggeLiu · 2024-08-28T00:33:50Z

The same bug happened 1 year ago #1886

Thanks for noticing this, let me see if @jonathanmetzman has more insight once he is back.

DonggeLiu · 2024-08-28T00:42:32Z

experiment/runner.py

@@ -95,6 +95,8 @@ def _clean_seed_corpus(seed_corpus_dir):
 def get_clusterfuzz_seed_corpus_path(fuzz_target_path):
    """Returns the path of the clusterfuzz seed corpus archive if one exists.
    Otherwise returns None."""
+    if not fuzz_target_path:
+        return None


Add an error log here because this is unexpected.

One question, why is this function even called?

https://github.com/google/fuzzbench/blob/master/experiment/runner.py#L277
I think this is the line that eventually calls this line. But for example, when we observed the error for addison's experiment, the ossfuzz corpus was NOT used right? (unless they specified oss-fuzz-corpus: true)
then why we would unpack the clusterfuzz seed corpus at all?

aren't the seed corpus already prepared in build.sh or Dockerfile in each of the benchmarks?

Do you know if this env var CUSTOM_SEED_CORPUS_DIR set in normal(?) run or not?

To me these two lines seem wrong

elif not environment.get('CUSTOM_SEED_CORPUS_DIR'): _unpack_clusterfuzz_seed_corpus(target_binary, input_corpus)

even if we don't use custom_seed_corpus_dir we don't necessarily need clusterfuzz seed corpus, do we??

although why this target_binary is None is another problem that needs investigation

addisoncrump · 2024-08-28T23:53:20Z

Just to reiterate, this is a major threat to validity -- especially when cached data is used. The cache completely overwrites the report, so the final report generated is simply showing only the last successful experiment. This effectively invalidates all future Fuzzbench reports until this issue is resolved.

I think the report generation issue indicates that safeguards should be put in place that simply terminate the experiment in such degenerative cases, since the results are effectively guaranteed to be invalid.

1. Fix `TypeError: expected str, bytes or os.PathLike object, not NoneType` in [`2024-08-10-test`](google#2020 (comment)). ```python Traceback (most recent call last): File "/src/experiment/runner.py", line 468, in experiment_main runner.conduct_trial() File "/src/experiment/runner.py", line 290, in conduct_trial self.set_up_corpus_directories() File "/src/experiment/runner.py", line 275, in set_up_corpus_directories _unpack_clusterfuzz_seed_corpus(target_binary, input_corpus) File "/src/experiment/runner.py", line 144, in _unpack_clusterfuzz_seed_corpus seed_corpus_archive_path = get_clusterfuzz_seed_corpus_path( File "/src/experiment/runner.py", line 98, in get_clusterfuzz_seed_corpus_path fuzz_target_without_extension = os.path.splitext(fuzz_target_path)[0] File "/usr/local/lib/python3.10/posixpath.py", line 118, in splitext p = os.fspath(p) TypeError: expected str, bytes or os.PathLike object, not NoneType ``` This happens on [many benchmarks+fuzzers](https://pantheon.corp.google.com/logs/query;query=%222024-08-10-test%22%0Aseverity%3E%3DERROR%0A--Hide%20similar%20entries%0A-%2528jsonPayload.message%3D~%22Error%20watching%20metadata:%20context%20canceled%22%2529%0A--End%20of%20hide%20similar%20entries;cursorTimestamp=2024-08-10T11:04:34.735815901Z;duration=P7D?project=fuzzbench&mods=logs_tg_prod). To be investigated later: 1. Why `fuzz_target_path` is `None`. 2. Why this did not happen in other recent experiments. 3. I thought I had seen this a long ago, Déjà vu? 2. Fixing `No such file or directory: '/work/measurement-folders/<benchmark>-<fuzzer>/merged.json`: ```python Traceback (most recent call last): File "/work/src/experiment/measurer/coverage_utils.py", line 74, in generate_coverage_report coverage_reporter.generate_coverage_summary_json() File "/work/src/experiment/measurer/coverage_utils.py", line 141, in generate_coverage_summary_json result = generate_json_summary(coverage_binary, File "/work/src/experiment/measurer/coverage_utils.py", line 269, in generate_json_summary with open(output_file, 'w', encoding='utf-8') as dst_file: FileNotFoundError: [Errno 2] No such file or directory: '/work/measurement-folders/lcms_cms_transform_fuzzer-centipede/merged.json' ``` 3. Remove incompatible benchmarks: `openh264_decoder_fuzzer`, `stb_stbi_read_fuzzer`

@addisoncrump

Temporarily disable benchmark `stb_stbi_read_fuzzer` and `openh264_decoder_fuzzer`from cloud experiments, becaue they are [proven](google#2023 (comment)) to be incompatible in cloud build/run environment. @addisoncrump kindly confirmed that they [work in local experiments](google#2023 (comment)).

@addisoncrump

Temporarily disable benchmark `stb_stbi_read_fuzzer` and `openh264_decoder_fuzzer`from cloud experiments, becaue they are [proven](google#2023 (comment)) to be incompatible in cloud build/run environment. @addisoncrump kindly confirmed that they [work in local experiments](google#2023 (comment)).

Fix bug when fuzz_target_path is None

03a5e32

DonggeLiu requested a review from jonathanmetzman August 12, 2024 12:49

DonggeLiu mentioned this pull request Aug 12, 2024

Archive coverage data alongside corpus archives #2020

Closed

A dummy comment to enable PR exp

4eb4f3b

Ensure the file's parent dir always exists before writing to it

cd18345

DonggeLiu force-pushed the fix-non-path branch from 5247b68 to cd18345 Compare August 12, 2024 13:53

These two benchmarks failed on coverage report generation.

50bdf34

DonggeLiu changed the title ~~Fix bug when fuzz_target_path is None~~ Fix recent FuzzBench cloud experiment failures Aug 13, 2024

tokatoka mentioned this pull request Aug 13, 2024

Seed experiment #2025

Open

Revert "These two benchmarks failed on coverage report generation."

e4a52c8

This reverts commit 50bdf34.

addisoncrump mentioned this pull request Aug 14, 2024

Update libafl-based fuzzers (from AFL++ fork) #2027

Merged

Revert "A dummy comment to enable PR exp"

dcabe6e

This reverts commit 4eb4f3b.

DonggeLiu merged commit b2f87ff into master Aug 15, 2024
5 checks passed

DonggeLiu deleted the fix-non-path branch August 15, 2024 22:01

DonggeLiu mentioned this pull request Aug 16, 2024

Disable incompatible benchmarks for cloud experiments #2030

Merged

DonggeLiu commented Aug 28, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix recent FuzzBench cloud experiment failures #2023

Fix recent FuzzBench cloud experiment failures #2023

DonggeLiu commented Aug 12, 2024 •

edited

Loading

DonggeLiu commented Aug 12, 2024

DonggeLiu commented Aug 12, 2024

DonggeLiu commented Aug 12, 2024

DonggeLiu commented Aug 13, 2024 •

edited

Loading

DonggeLiu commented Aug 13, 2024

DonggeLiu commented Aug 13, 2024

DonggeLiu commented Aug 13, 2024

addisoncrump commented Aug 13, 2024

addisoncrump commented Aug 13, 2024

DonggeLiu commented Aug 13, 2024

DonggeLiu commented Aug 13, 2024

addisoncrump commented Aug 14, 2024

addisoncrump commented Aug 14, 2024 •

edited

Loading

tokatoka commented Aug 14, 2024

DonggeLiu commented Aug 14, 2024

addisoncrump commented Aug 14, 2024

addisoncrump commented Aug 14, 2024

DonggeLiu commented Aug 15, 2024 •

edited

Loading

DonggeLiu commented Aug 15, 2024

DonggeLiu commented Aug 15, 2024

addisoncrump commented Aug 15, 2024

DonggeLiu commented Aug 15, 2024

tokatoka commented Aug 27, 2024

DonggeLiu commented Aug 28, 2024

DonggeLiu Aug 28, 2024

tokatoka Aug 28, 2024 •

edited

Loading

tokatoka Aug 28, 2024

tokatoka Aug 28, 2024

addisoncrump commented Aug 28, 2024

Fix recent FuzzBench cloud experiment failures #2023

Fix recent FuzzBench cloud experiment failures #2023

Conversation

DonggeLiu commented Aug 12, 2024 • edited Loading

DonggeLiu commented Aug 12, 2024

DonggeLiu commented Aug 12, 2024

DonggeLiu commented Aug 12, 2024

DonggeLiu commented Aug 13, 2024 • edited Loading

DonggeLiu commented Aug 13, 2024

DonggeLiu commented Aug 13, 2024

DonggeLiu commented Aug 13, 2024

addisoncrump commented Aug 13, 2024

addisoncrump commented Aug 13, 2024

DonggeLiu commented Aug 13, 2024

DonggeLiu commented Aug 13, 2024

addisoncrump commented Aug 14, 2024

addisoncrump commented Aug 14, 2024 • edited Loading

tokatoka commented Aug 14, 2024

DonggeLiu commented Aug 14, 2024

addisoncrump commented Aug 14, 2024

addisoncrump commented Aug 14, 2024

DonggeLiu commented Aug 15, 2024 • edited Loading

DonggeLiu commented Aug 15, 2024

DonggeLiu commented Aug 15, 2024

addisoncrump commented Aug 15, 2024

DonggeLiu commented Aug 15, 2024

tokatoka commented Aug 27, 2024

DonggeLiu commented Aug 28, 2024

DonggeLiu Aug 28, 2024

Choose a reason for hiding this comment

tokatoka Aug 28, 2024 • edited Loading

Choose a reason for hiding this comment

tokatoka Aug 28, 2024

Choose a reason for hiding this comment

tokatoka Aug 28, 2024

Choose a reason for hiding this comment

addisoncrump commented Aug 28, 2024

DonggeLiu commented Aug 12, 2024 •

edited

Loading

DonggeLiu commented Aug 13, 2024 •

edited

Loading

addisoncrump commented Aug 14, 2024 •

edited

Loading

DonggeLiu commented Aug 15, 2024 •

edited

Loading

tokatoka Aug 28, 2024 •

edited

Loading