flux-job: flux_kvs_lookup_get: Value too large for defined data type #6256

grondo · 2024-09-04T17:41:55Z

A user is seeing this error from flux run in a subinstance with a job that may generate a lot of output. Not sure we've seen this one before.

job-info.err[0]: info_lookup_continuation: flux_kvs_lookup_get: Value too large for defined data type
ob-info.err[0]: main_namespace_lookup_continuation: flux_rpc_get_unpack: Value too large for defined data type
flux-job: flux_job_event_watch_get: Value too large for defined data type

The text was updated successfully, but these errors were encountered:

chu11 · 2024-09-04T21:08:10Z

Was this a flux job info lookup on stdin/stdout? I'm wondering if stdin/stdout is pretty big and somewhere along the way some 2G boundary is crossed.

grondo · 2024-09-04T21:10:02Z

This was likely flux job attach since it was in the output of flux run. According to the user, --output and --error were used but trying to verify that.

chu11 · 2024-09-05T22:44:47Z

So I'm guessing this user produced a bunch of stdout into the KVS, and once it was > 2G, EOVERFLOW occurs b/c the data returned would be greater than the size of an int. Reproduced via

>flux submit t/shell/lptest 1000 5000000
ƒA6i9Muy

<wait some time, make that thing make a bunch of stdout>

>flux jobs
       JOBID USER     NAME       ST NTASKS NNODES     TIME INFO
    ƒA6i9Muy achu     lptest      R      1      1   5.095m corona212

> flux job attach ƒA6i9Muy
Sep 05 15:36:14.999036 PDT job-info.err[0]: info_lookup_continuation: flux_kvs_lookup_get: Value too large for defined data type
Sep 05 15:36:14.999190 PDT job-info.err[0]: main_namespace_lookup_continuation: flux_rpc_get_unpack: Value too large for defined data type
flux-job: flux_job_event_watch_get: Value too large for defined data type

I'm not really sure how to fix / improve this w/o major re-engineering (i.e. read KVS values w/ offsets / seeking values ... support > INT_MAX data ... may require RFC changes, don't know off the top of my head where we define things as ints only).

For the time being ...

Better error message?
Return truncated output? (as an option?)

chu11 · 2024-09-05T23:00:01Z

To my surprise we don't define things as int in the KVS treeobj format. We just have specific API functions that use int, such as treeobj_create_val(). I suppose we technically could support larger stuffs by just updating everything to use size_t (or unsigned int) ... but I don't know if we want to go down that path.

garlick · 2024-09-05T23:46:56Z

Well it is a bit silly to be failing for that reason, although fetching a > 2G message in a KVS get may fail for other reasons. It's certainly going to be a self-DoS for a while.

chu11 · 2024-09-06T05:33:16Z

just brainstorming here, we could support some type of flux_kvs_lookup_stream()? return each blobref in the valref array one at a time?

chu11 · 2024-09-06T15:14:20Z

Few brainstormings this morning:

It just occurred to me that b/c this is going through kvs-watch, an "offset" already is already being tracked, hypothetically could pass that into KVS lookup easily.
for appended eventlogs, nothing stops us from only returning less than 2G, and some flag that says "truncated" along w/ above so kvs-watch knows to ask for more
we are already using the special kvs.lookup-plus from within kvs-watch, so pretty easy to modify protocol there.

garlick · 2024-09-06T16:20:56Z

I wonder if it would be easier to implement a limit on storing a large value in the first place?

I hate to suggest modifying RFC 11 but we could maybe consider adding an optional size to val and valref?
(optional in the sense that it is allowed to be missing from older metadata)

If we have a running total size for a kvs value, then we have the means to reject an append that would cause it to exceed some maximum.

chu11 · 2024-09-06T16:50:29Z

I hate to suggest modifying RFC 11 but we could maybe consider adding an optional size to val and valref?
(optional in the sense that it is allowed to be missing from older metadata)
If we have a running total size for a kvs value, then we have the means to reject an append that would cause it to exceed some maximum.

That seems like a good idea for a full on correct solution in the KVS.

But if we're going down the path of just limiting appends, for a quicker solution for this case, perhaps we can abuse the OUTPUT_LIMIT support in the shell to limit stdout/stderr? I see this in the shell's code

    /*  Set default to unlimited (0) for single-user instances,
     *  O/w use the default multiuser output limit:
     */
    if (out->shell->broker_owner == getuid())
        out->kvs_limit_string = "0";
    else
        out->kvs_limit_string = MULTIUSER_OUTPUT_LIMIT;

if we just change that "0" to "2G" (or perhaps less if we want to be conservative), I think the stdout/stderr will probably just be capped for single user instances.

garlick · 2024-09-06T16:53:18Z

Great point!

chu11 · 2024-09-06T16:57:10Z

ok, lets go with that quick and dirty solution in the shell. The KVS thing you mention above is the longer term generalized solution, b/c users might be willing to abuse it if they write their own data to the KVS.

garlick · 2024-09-06T17:01:14Z

OK, I'll open an issue on the long term one. It requires more thought/discussion/spelunking I think.

grondo · 2024-09-06T17:04:14Z

We've had #5148 open since the original problem

Problem: The KVS has a size limit of INT_MAX for when returning kvs values. This limit can be exceeded by a job's standard output because it is continually appended and the total size is not yet tracked by the KVS. When reading the output later, such as via `flux job attach`, this can lead to EOVERFLOW errors. Solution: For a single user instance, default to a maximum standard output of 1G instead of "unlimited". 1G should provide a practical maximum for most users and encourage them to send standard output to a file if they want to save excess standard output. If desired, the value can still be overwritten via the "output.limit" setting. Fixes flux-framework#6256

Problem: The KVS has a size limit of INT_MAX for when returning kvs values. This limit can be exceeded by a job's standard output because it is continually appended and the total size is not yet tracked by the KVS. When reading the output later, such as via `flux job attach`, this can lead to EOVERFLOW errors. Solution: For a single user instance, default to a maximum standard output of 1G instead of "unlimited". 1G should provide a practical maximum for most users and encourage them to send standard output to a file if they want to save excess standard output. Do not allow configuration larger than this. As a consequence the configuration of "unlimited" is no longer allowed. Fixes flux-framework#6256

chu11 self-assigned this Sep 6, 2024

chu11 mentioned this issue Sep 6, 2024

kvs: do not allow appends that exceed maximum allowed value size #6264

Open

garlick mentioned this issue Sep 6, 2024

prevent or mitigate jobs writing large files to kvs stdio #5148

Open

chu11 mentioned this issue Sep 6, 2024

shell: limit output to 1G in single user instances #6268

Merged

mergify bot closed this as completed in #6268 Sep 9, 2024

chu11 mentioned this issue Sep 13, 2024

libsubprocess: do not spin on large lines #6281

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flux-job: flux_kvs_lookup_get: Value too large for defined data type #6256

flux-job: flux_kvs_lookup_get: Value too large for defined data type #6256

grondo commented Sep 4, 2024

chu11 commented Sep 4, 2024

grondo commented Sep 4, 2024

chu11 commented Sep 5, 2024 •

edited

Loading

chu11 commented Sep 5, 2024

garlick commented Sep 5, 2024

chu11 commented Sep 6, 2024 •

edited

Loading

chu11 commented Sep 6, 2024 •

edited

Loading

garlick commented Sep 6, 2024 •

edited

Loading

chu11 commented Sep 6, 2024

garlick commented Sep 6, 2024

chu11 commented Sep 6, 2024

garlick commented Sep 6, 2024

grondo commented Sep 6, 2024

flux-job: flux_kvs_lookup_get: Value too large for defined data type #6256

flux-job: flux_kvs_lookup_get: Value too large for defined data type #6256

Comments

grondo commented Sep 4, 2024

chu11 commented Sep 4, 2024

grondo commented Sep 4, 2024

chu11 commented Sep 5, 2024 • edited Loading

chu11 commented Sep 5, 2024

garlick commented Sep 5, 2024

chu11 commented Sep 6, 2024 • edited Loading

chu11 commented Sep 6, 2024 • edited Loading

garlick commented Sep 6, 2024 • edited Loading

chu11 commented Sep 6, 2024

garlick commented Sep 6, 2024

chu11 commented Sep 6, 2024

garlick commented Sep 6, 2024

grondo commented Sep 6, 2024

chu11 commented Sep 5, 2024 •

edited

Loading

chu11 commented Sep 6, 2024 •

edited

Loading

chu11 commented Sep 6, 2024 •

edited

Loading

garlick commented Sep 6, 2024 •

edited

Loading