"Note" non-fatal INTAKE behaviors #3510

dbutenhof · 2023-08-01T20:31:29Z

PBENCH-1231

After accidentally observing an uploaded dataset that was marked "archiveonly" not indexed (because the file name didn't match the result directory name), we decided that the intake process should make the reasons more obvious.

I've handled this by adding "notes", which are recorded both in the successful JSON response payload and in the finalized audit record. While I was at it, I exposed our decisions regarding the canonical benchmarked workload name and the computed expiration date.

While doing this, I stripped the rather prolific logging of INTAKE to just two, a "pre" log identifying the file and user, and a "post" log providing the summary file system status (adding the size of the tarball).

I debated logging the "notes", but didn't. On one hand, it would make the info more easily visible; on another it restores verbosity, and the notes strings are going to be unwieldy in the log. I've also several times considered that we might log every audit record, which would include attributes... but might be hard to read in the linear log format.

In any case, it's easy enough to do an Audit query on the dataset resource_id even when the client doesn't capture the "notes".

webbnh

After accidentally observing an uploaded dataset that was marked "archiveonly" not indexed

I can't quite parse that...is it missing a word or two?

I stripped the rather prolific logging of INTAKE to just two, a "pre" log identifying the file and user, and a "post" log providing the summary file system status (adding the size of the tarball).

This seems generally good, although I for some reason I'm sad to lose the file system status from the "pre" message (I don't know why). However, I do have a concern that it won't be sufficiently easy to match the "post" message with its corresponding "pre" message, but perhaps that's just because I haven't seen it in action. Also, I'm wondering about the fact that the "post" message is positioned sort of in the middle of the request.

I debated logging the "notes", but didn't. [...] it's easy enough to do an Audit query on the dataset resource_id even when the client doesn't capture the "notes".

In this instance, I'm not sure that less is more. I agree that we shouldn't be logging every audit record, and I support avoiding overloading the log. However, I don't think it is "easy" to do an audit query, relative to looking at the log in Opensearch. I wonder if we should be splitting the difference: I think there are some notes which are worth logging (like the fact that the Server decided on its own not to index the result) while others can rest in the audit (like projected expiration dates).

lib/pbench/server/api/resources/intake_base.py

lib/pbench/test/functional/server/test_datasets.py

dbutenhof

This seems generally good, although I for some reason I'm sad to lose the file system status from the "pre" message (I don't know why). However, I do have a concern that it won't be sufficiently easy to match the "post" message with its corresponding "pre" message, but perhaps that's just because I haven't seen it in action. Also, I'm wondering about the fact that the "post" message is positioned sort of in the middle of the request.

The "post" message comes after everything but the final score keeping: we've settled all the files in their final places. But, yeah, it has sometimes bugged me that I didn't put it later: just not quite enough to move it. There are cases that'll back out after here, so in one sense the "final" disk report is redundant since we'll be freeing that space. At the time, though, I wanted to know that, and I didn't want yet another "mid" report. Logically, it might make more sense to move this to where I do the Sync and the final audit log; but I'm reluctant to either add another log here or to leave it unlogged. Hmm...

I debated logging the "notes", but didn't. [...] it's easy enough to do an Audit query on the dataset resource_id even when the client doesn't capture the "notes".

In this instance, I'm not sure that less is more. I agree that we shouldn't be logging every audit record, and I support avoiding overloading the log. However, I don't think it is "easy" to do an audit query, relative to looking at the log in Opensearch. I wonder if we should be splitting the difference: I think there are some notes which are worth logging (like the fact that the Server decided on its own not to index the result) while others can rest in the audit (like projected expiration dates).

I guess I'll split the difference and add a warning log on the no-metadata path to clearly mark it.

lib/pbench/server/api/resources/intake_base.py

lib/pbench/test/functional/server/test_datasets.py

PBENCH-1231 After accidentally observing an uploaded dataset that was marked "archiveonly" not indexed (because the file name didn't match the result directory name), we decided that the intake process should make the reasons more obvious. I've handled this by adding "notes", which are recorded both in the successful JSON response payload and in the finalized audit record. While I was at it, I exposed our decisions regarding the canonical benchmarked workload name and the computed expiration date. While doing this, I stripped the rather prolific logging of `INTAKE` to just two, a "pre" log identifying the file and user, and a "post" log providing the summary file system status (adding the size of the tarball). I debated logging the "notes", but didn't. On one hand, it would make the info more easily visible; on another it restores verbosity, and the notes strings are going to be unwieldy in the log. I've also several times considered that we might log every audit record, which would include attributes... but might be hard to read in the linear log format. In any case, it's easy enough to do an Audit query on the dataset resource_id even when the client doesn't capture the "notes".

webbnh

👍

dbutenhof added Server API Of and relating to application programming interfaces to services and functions Audit Of and relating to server side changes to data Operations Related to operation and monitoring of a service labels Aug 1, 2023

dbutenhof requested review from ndokos and webbnh August 1, 2023 20:31

dbutenhof self-assigned this Aug 1, 2023

webbnh previously approved these changes Aug 1, 2023

View reviewed changes

lib/pbench/server/api/resources/intake_base.py Outdated Show resolved Hide resolved

lib/pbench/test/functional/server/test_datasets.py Outdated Show resolved Hide resolved

dbutenhof commented Aug 2, 2023

View reviewed changes

lib/pbench/server/api/resources/intake_base.py Outdated Show resolved Hide resolved

lib/pbench/test/functional/server/test_datasets.py Outdated Show resolved Hide resolved

dbutenhof added 2 commits August 2, 2023 19:00

Cleanup

4ac0561

dbutenhof dismissed webbnh’s stale review via 4ac0561 August 2, 2023 23:01

dbutenhof force-pushed the aonly branch from e575140 to 4ac0561 Compare August 2, 2023 23:01

webbnh approved these changes Aug 2, 2023

View reviewed changes

dbutenhof merged commit c3688cd into distributed-system-analysis:main Aug 3, 2023

dbutenhof deleted the aonly branch August 3, 2023 05:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Note" non-fatal INTAKE behaviors #3510

"Note" non-fatal INTAKE behaviors #3510

dbutenhof commented Aug 1, 2023

webbnh left a comment

dbutenhof left a comment

webbnh left a comment

"Note" non-fatal INTAKE behaviors #3510

"Note" non-fatal INTAKE behaviors #3510

Conversation

dbutenhof commented Aug 1, 2023

webbnh left a comment

Choose a reason for hiding this comment

dbutenhof left a comment

Choose a reason for hiding this comment

webbnh left a comment

Choose a reason for hiding this comment