Support update() with conditions #76

akudiyar · 2020-10-29T16:30:45Z

Use case: customer wants to update a single field value in a large sharded space with some conditions. For example, change statuses of all orders with status "PAYMENT_RECEIVED" to "CLOSED".

Proposed API variants:
a) Allow update() method to receive conditions instead of key parts:

    crud.update('test_space', {{'=', 'status', 'PAYMENT_RECEIVED'}}, {{'=', 'status', 'CLOSED'}})

b) Add a new method updateall() (the method name may be different):

    crud.updateall('test_space', {{'=', 'status', 'PAYMENT_RECEIVED'}}, {{'=', 'status', 'CLOSED'}})

The same variants may be used for the upsert() operation.

The text was updated successfully, but these errors were encountered:

knazarov · 2020-10-29T18:58:40Z

I'm against extending the API to have methods that are created for the bulk update of whole spaces. This has the ability to lock the whole instance for a long time.

akudiyar · 2020-10-29T19:07:07Z

Big selects have the same ability, and delete operation has been already supporting conditions. It is a matter of a convenient API and not forcing customers to write boilerplate code.

Consider the case when we substitute UPDATE .. WHERE SQL operation with a CRUD analog. Should we load data one by one from a large space to the client or maybe updating it in place is better?

knazarov · 2020-10-29T19:13:49Z

That's why we will incur the same limitation we have for selects as we have in Data Grid. There will be a limit on how many rows a select can scan before returning with an error. Unbound selects shouldn't be allowed.

Yes, I believe it's better to pull data to the client and then update it in batches. It will keep the database operational at least. Doing an update that touches the whole data set will just freeze the database.

If you want to do an update of around 1-10 items, then pulling them to the client should not be a problem. If you want to update the whole table, then you've clearly picked the wrong way to implement it.

knazarov · 2020-10-29T19:15:27Z

Updating whole datasets should be done by a background task.

akudiyar · 2020-10-29T19:24:57Z

If you mean "asynchronous" task which reports events about its state, it is a good case and may be supported in the connectors. So the users may wait or not until the task finishes, which depends on the business scenario. In this case, it seems like an implementation detail, and the update API may look like the proposed above.

Btw, the delete() operation has the same API now.

akudiyar · 2020-10-29T19:28:25Z

If you want to do an update of around 1-10 items, then pulling them to the client should not be a problem. If you want to update the whole table, then you've clearly picked the wrong way to implement it.

It looks like "select for update" and I am afraid it is less efficient than the built-in in-place update.

knazarov · 2020-10-29T19:36:36Z

The delete API doesn't have the same signature. It doesn't have the condition as in select. You can only delete by primary key. And it's a deliberate choice.

I already told you that we won't support mass updates from the update function. It can block the event loop for a long time and we don't want that.

akudiyar · 2020-10-29T19:41:27Z

But what do you think about the mass updates in an asynchronous task which returns a future and produces events? Batch update using small portions of data may be implemented in CRUD this way.

knazarov · 2020-10-29T19:52:24Z

The current API is stateless and doesn't support long-running tasks. And the basic CRUD operations won't be stateful by design. This is a deliberate choice.

In the future, it is possible that we will make some sort of background tasks, but I will resist it unless there is a solid argument for why they are useful.

For now, use "select for update". It should solve most of your problems.

Totktonada · 2022-01-21T08:48:27Z

We discussed it with the product team and decided to don't implement it here. Sorry, Alexey.

akudiyar mentioned this issue Apr 11, 2022

Support registering Lua API functions on storages and routers in runtime tarantool/cartridge#1799

Closed

Totktonada added the feature A new functionality label Jun 18, 2021

LeonidVas added the teamP label Dec 23, 2021

Totktonada closed this as completed Jan 21, 2022

Totktonada added wontfix This will not be worked on and removed teamP labels Jan 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support update() with conditions #76

Support update() with conditions #76

akudiyar commented Oct 29, 2020

knazarov commented Oct 29, 2020

akudiyar commented Oct 29, 2020

knazarov commented Oct 29, 2020

knazarov commented Oct 29, 2020

akudiyar commented Oct 29, 2020 •

edited

Loading

akudiyar commented Oct 29, 2020

knazarov commented Oct 29, 2020

akudiyar commented Oct 29, 2020

knazarov commented Oct 29, 2020

Totktonada commented Jan 21, 2022

Support update() with conditions #76

Support update() with conditions #76

Comments

akudiyar commented Oct 29, 2020

knazarov commented Oct 29, 2020

akudiyar commented Oct 29, 2020

knazarov commented Oct 29, 2020

knazarov commented Oct 29, 2020

akudiyar commented Oct 29, 2020 • edited Loading

akudiyar commented Oct 29, 2020

knazarov commented Oct 29, 2020

akudiyar commented Oct 29, 2020

knazarov commented Oct 29, 2020

Totktonada commented Jan 21, 2022

akudiyar commented Oct 29, 2020 •

edited

Loading