KVS: need flux_kvs_move() or flux_kvs_txn_move() #1747

garlick · 2018-10-22T13:35:56Z

The job manager needs a way to move a job from the active to inactive directories. Currently that would be a combination of a snapshot lookup, a put to the new key, and an unlink of the old key. An example exists in cmd_move() in the flux-kvs command.

The job manager could simply chain futures for these operations (lookup as one RPC, put + unlink + commit as a second one), but I thought I'd open this issue to consider whether it's a useful function to expose in the API, and which way to implement is best:

as a composite future (e.g. hiding lookup + commit inside)
as a native operation that could be combined with other operations in a transaction

I think 2) would be most useful, but I'm not sure how difficult it would be to implement on the KVS module side.

The text was updated successfully, but these errors were encountered:

garlick · 2018-10-22T15:04:26Z

On 2), I guess it would be two native operations that could appear sequentially in a transaction to represent a move: something like link to crate a new reference from the namespace, followed by the existing unlink on the old reference.

This link is a bit different from UNIX link(2) where you get two names for the same dynamic content. Here the links are just duplicate hash references, so one can change going forward without affecting the other. Maybe that argues for a different name like snapshot?

If link / snapshot could be allowed to cross namespaces, it could also be used for "reaping" a private namespace when a job is made inactive.

chu11 · 2018-10-22T18:11:05Z

If I'm thinking this through correctly a link / snapshot is still just a lookup & put in a transaction, and the move transaction would be the link / snapshot & an unlink.

We already can unlink in a transaction, so that's done. We have mechanisms in place for kvs transactions to lookup missing references (i.e. the root dirref is not yet loaded locally). In the past, we only had to do lookups for missing references within the "tree" of what we were modifying/changing. But now we would have to lookup missing references just to figure out what we need to link to. So it does slow transactions down by adding this extra lookups in the middle. I don't think it's impossible to add, would just be a little extra work plus have to carefully figure out corner cases (i.e. link a val vs a valref vs a symlink).

garlick · 2018-10-22T18:35:32Z

Could it just always copy a directory entry as is, without checking type? Seems like that would work in all cases.

And actually maybe copy is the right name for the new operation?

chu11 · 2018-10-22T18:55:12Z

Could it just always copy a directory entry as is, without checking type? Seems like that would work in all cases.

Ahh you're right, can just copy the entry.

And actually maybe copy is the right name for the new operation?

If we want support a native copy, then we probably have to consider both shallow & deep copies. The latter being more expensive obviously and probably not what we'd want to use in a move operation.

garlick · 2018-10-22T19:07:49Z

If we want support a native copy, then we probably have to consider both shallow & deep copies. The latter being more expensive obviously and probably not what we'd want to use in a move operation.

I think actually copying a dirent of any type is equivalent to a deep copy. See cmd_copy() and cmd_move() in flux-kvs.c. Or am I misunderstanding what you mean by deep copy?

chu11 · 2018-10-22T20:56:46Z

I think actually copying a dirent of any type is equivalent to a deep copy.

Right now in treeobj.c shallow and deep copies are different for treeobj dir objects.

garlick · 2018-10-22T21:00:40Z

Oh sorry I might have a shallow (so to speak) understanding of commit processing at this point. I just meant the end result of copying a dirent (shallow from the copy_treeobj() point of view), would be sufficient to make it look like all its contents have been copied.

chu11 · 2018-10-22T22:44:56Z

I see now, a shallow copy is sufficient b/c it is "deep" from the user perspective looking at the KVS. I'm sure I got confused too :-)

chu11 · 2018-10-22T23:19:20Z

It just occurred to me, having a native copy or move in the KVS module might be difficult given the potential to cross namespaces. For example, doing a flux kvs move ns:FOO/foo ns:BAR/bar

Right now we have a rule that kvs txns can only occur within a single namespace.

garlick · 2018-10-23T02:26:43Z

Well, we could stick to the restriction. Would that make it more doable?

chu11 · 2018-10-23T16:22:31Z

it definitely makes it more doable. Effectively I have to add a lookup() in the middle of the normal kvs transaction processing and I think it can be done semi-easily. After that it's two kvs modifications instead of one (a "put" followed by a "unlink"), but the code is basically all there already.

Other main thing is right now all kvs transactions are effectively "puts" (regardless if it's a put of a val or dir, that's all encoded in the treeobj). So we'd have to add a "transaction type" kind of field to each transaction operation, so the kvs module can determine it's a "put", "copy", "move" or what not. And obviously for "copy" / "move" we need to add an additional key field for the destination. I don't think the transactions are in any RFC so I don't think we need to update an RFC for this, although maybe we should?

chu11 · 2018-10-23T17:46:29Z

it's probably also worth mentioning that we would want flux kvs move and flux kvs copy to stick with its current implementation, since we do not want it to be restricted to a single namespace. A flux_kvs_txn_copy/move() would only for situations we want to stick a bunch of copies/moves in with other transactions.

garlick · 2018-10-23T17:54:58Z

it definitely makes it more doable. Effectively I have to add a lookup() in the middle of the normal kvs transaction processing and I think it can be done semi-easily. After that it's two kvs modifications instead of one (a "put" followed by a "unlink"), but the code is basically all there already.

Actually I think since an unlink could be added to a transaction externally, we'd just want to support "copy" (user calls flux_kvs_txn_copy() then flux_kvs_txn_unlink() then commits once).

It seems like this is not that easy to get done within the KVS module, and maybe we should just go with the composite future implementation instead (implementing it within libkvs)? At least for now?

garlick · 2018-11-05T01:13:30Z

@chu11 were you already working on the libkvs implementations of move and copy?

I was playing around with the composite futures and got something working which I'm fine with tossing out if your'e already going on this. LMK, otherwise I'll go ahead and do the work to turn my toy into something better.

chu11 · 2018-11-05T16:33:34Z

haven't started yet as I've been working on #1651.

Add flux_kvs_copy() and flux_kvs_move() based on composite futures. flux_kvs_copy() performs: 1) a lookup of the directory entry for the source key 2) a put, commit that sets the destination key to a copy of the dirent. kvs_move() performs: 1) a flux_kvs_copy() 2) an unlink, commit on the source key Fixes flux-framework#1747

chu11 mentioned this issue Oct 23, 2018

kvs: convert missing_refs_list to hash or deal with duplicates #1751

Closed

chu11 self-assigned this Oct 23, 2018

chu11 removed their assignment Oct 24, 2018

garlick mentioned this issue Nov 10, 2018

kvs: add flux_kvs_copy(), flux_kvs_move() #1828

Merged

garlick self-assigned this Nov 11, 2018

chu11 closed this as completed in 31a739e Nov 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KVS: need flux_kvs_move() or flux_kvs_txn_move() #1747

KVS: need flux_kvs_move() or flux_kvs_txn_move() #1747

garlick commented Oct 22, 2018

garlick commented Oct 22, 2018

chu11 commented Oct 22, 2018

garlick commented Oct 22, 2018

chu11 commented Oct 22, 2018

garlick commented Oct 22, 2018

chu11 commented Oct 22, 2018

garlick commented Oct 22, 2018

chu11 commented Oct 22, 2018

chu11 commented Oct 22, 2018

garlick commented Oct 23, 2018

chu11 commented Oct 23, 2018

chu11 commented Oct 23, 2018

garlick commented Oct 23, 2018

garlick commented Nov 5, 2018

chu11 commented Nov 5, 2018

KVS: need flux_kvs_move() or flux_kvs_txn_move() #1747

KVS: need flux_kvs_move() or flux_kvs_txn_move() #1747

Comments

garlick commented Oct 22, 2018

garlick commented Oct 22, 2018

chu11 commented Oct 22, 2018

garlick commented Oct 22, 2018

chu11 commented Oct 22, 2018

garlick commented Oct 22, 2018

chu11 commented Oct 22, 2018

garlick commented Oct 22, 2018

chu11 commented Oct 22, 2018

chu11 commented Oct 22, 2018

garlick commented Oct 23, 2018

chu11 commented Oct 23, 2018

chu11 commented Oct 23, 2018

garlick commented Oct 23, 2018

garlick commented Nov 5, 2018

chu11 commented Nov 5, 2018