importccl: support default expressions #48253

pbardea · 2020-04-30T20:41:30Z

It would be nice if IMPORT could support default column expressions. We probably want to plumb down an eval ctx and populate the defaults during row production.

pbardea · 2020-06-11T16:45:44Z

Expanding a bit on this here, I think in particular we want to support the following:

Given:
file.csv:

1,2
2,NULL

I think to start we should at least implement:

> IMPORT TABLE t1 (a INT, b INT DEFAULT 42) CSV DATA ('nodelocal://1/file.csv');
> SELECT * FROM t1;
  a |  b
----+-----
  1 |  2
  2 | 42
> CREATE TABLE t2 (a INT, b INT DEFAULT 42);
> IMPORT INTO t2 (a, b) CSV DATA ('nodelocal://1/file.csv');
> SELECT * FROM t2;
  a |  b
----+-----
  1 |  2
  2 | 42

I think it would also be reasonable to expect the following to work:

> CREATE TABLE t3 (a INT, b INT, c INT DEFAULT 42);
> IMPORT INTO t3 (a INT, b INT) CSV DATA ('nodelocal://1/file.csv');
> SELECT * FROM t3;
  a |  b   | c
----+------+-----
  1 |    2 | 42
  2 | NULL | 42

Anzoteh96 · 2020-06-16T18:47:56Z

I wonder if replacing NULL in a targeted column with DEFAULT is the intended way for IMPORT, given that this doesn't seem like the case when I do INSERT INTO:

> CREATE TABLE t1 (a INT, b INT DEFAULT 75)
> INSERT INTO t1 (a, b) VALUES (33, NULL)
> SELECT * FROM t1
 a |  b
----+-----
33| NULL

vs the following (where b is non-targeted column)

> CREATE TABLE t1 (a INT, b INT DEFAULT 75)
> INSERT INTO t1 (a) VALUES (33)
> SELECT * FROM t1
 a |  b
----+----
33|75

pbardea · 2020-06-16T18:52:55Z

Yep - good point. Default values can indeed be nullable and are only populated when not included in an insert statement.

Anzoteh96 · 2020-07-01T23:36:16Z

Reopened again: the linked PR #50295 only handles constant default expressions: the next step will be to support other expressions with Stable volatility, like now() (those whose value do not change within the same statement, but may change across different statements). It might make sense to add support for unique_rowid() too.

like now(), localtimestamp, transaction_timestamp, current_date This PR follows up from cockroachdb#50295 to add support for functions concerning current timestamps, which include now(), localtimestamp(), transaction_timestamp(), current_date(). This is achieved by injecting the walltime recorded at `importCtx` into evalCtx required for the evaluation of these functions. Partially addresses cockroachdb#48253. Release note (general change): timestamp functions are now supported by IMPORT INTO.

Anzoteh96 · 2020-08-10T20:01:29Z

Strategy for nextval
Goals / desired properties (in the order of priority, imho)

To ensure that the nextval values are unique, even after including those that have been populated by INSERT INTO.
To ensure that the nextval values are monotonically increasing (with respect to rows) within a file.
Ideally, the nextval values in a file should be consecutive (i.e. +1 between adjacent rows).

The last point, however, is difficult (if not impractical) to implement given that IMPORT including multiple files / tables can be done simultaneously. So it could be modified to make sure that there's minimal gaps between the numbers when they happen to be sorted (e.g. if 51 and 55 are both populated, we want 52, 53, 54 to be populated somewhere in the files too subject to the monotonicity constraint in (2)). Even so, the fact that inserts can be done during an IMPORT (into other tables that do not undergo the same IMPORT job but possibly use the same sequence(s) with the other tables being imported), makes this difficult (for the reasons described below). In addition, (2) is complicated by the fact that we have multiple processors processing the same import job simultaneously.

Strategy: from a table descriptor, the columns using any sequence and the corresponding sequences (under curval and nextval) can be extracted, from which we can determine at import planning stage (or perhaps, the beginning of the import stage itself) what are the sequences that we need to pay attention to. This means we can do the following:

For each sequence, the main processor / coordinator determines the curval (that is, either 0 or the value last used by nextval process).
Next, it supplies disjoint chunks of numbers for the processors (say there are 3 processors, then we could allocate numbers 1-1000, 1001-2000, 2001-3000 to processors 1, 2, 3, respectively). It also records / updates, internally, what is the last number being allocated (3000 in the example above). This has to be done in an atomic order to ensure that no other read and write (by, for example, other INSERTs of the same sequence(s) to other tables) to the sequence record is being done.
If a processor runs out of its slots, then it can request for more numbers (i.e. an additional chunk) on the fly from the coordinator, and the coordinator will simply update the last value allocated.

Notice that this is a modification from the previous strategy that says that at the end of an IMPORT, each processor communicates the highest number it used, and the highest of that +1 becomes the next value to be used for nextval. However, if an INSERT INTO (other table of same sequence) is used then it will be tricky to maintain the uniqueness of subsequent uses of this sequence. Consider the following scenario:

The chunks allocated are 1-1000, 1001-2000, 2001-3000 as per the example above.
An insert to other table using the same sequence happens in the middle of an IMPORT, and let's say it uses values 3001-3010.
The values last used by the 3 processors are 997, 1929, 2930.

The previous strategy would use 2931 as the next value for nextval but we might run into the problem of repeating the values 3001-3010 as per the insert (we could maintain a list / lookup to forbid 3001-3010 being used again but maintaining this list is more expensive compared to just maintaining a single number indicating the next value to be used for nextval).

Overall, this strategy should be safe to adopt as it maintains the uniqueness property; a drawback is that it could potentially leave a huge gap between numbers (say, we allocate 1000 numbers but only 10 rows are being imported. That's a gap of 990 numbers).

Separate from this: once this is supported, curval should also be supported too since we're basically reading the value off to know what's the most recent value being used from the lookup map. One scenario that could make this tricky is when we have nextval() and curval() simultaneously:

CREATE SEQUENCE myseq // fresh sequence, so starts at 0
CREATE TABLE t (a INT, b INT DEFAULT nextval('myseq'), c INT DEFAULT curval('myseq')
IMPORT INTO t (blah)

In this case: which curval are we using? The value right before IMPORT starts (in this case, 0), or the most updated value of myseq value after updated via nextval? The latter could be tricky because imagine we have something as crazy as

nextval('myseq') + curval('myseq') * nextval('myseq')

as default expression. Then it's not even clear what should be the value of curval.

dt · 2020-08-10T20:16:29Z

This sounds about right ,but I don't know if we need the main proc / coordinator in the picture at all -- when we run out of IDs in a given processor, we'll need to reserve more, and, as you say, that reservation needs to be done atomically from the point of view of any other traffic to the sequence unrelated to the IMPORT so we'll use a txn. If we have a txn, we don't really need to go back to the coordinator to coordinate, since if multiple processors need a chunk at the same time, the txn should serialize them on its own, and that avoids a lot of mess trying to do 2-way data flow / rpcs between processors and the coordinator.

The other thing I'd do is start with smaller chunk size -- e.g. reserve 10 initially, then on subsequent reservations go back for an order of magnitude more each time until we hit, say, 100k. That way small tables of 5 values don't blow a 100k hole in the sequence, but big tables don't spend all their time contending on the sequence reservations.

maddyblue · 2020-08-10T20:27:44Z

If you have a txn during an IMPORT won't that block any INSERTs from happening at the same time? Is that ok? Also, unless it's changed, sequence increments happen in their own internal txn such that they are atomic, but not transactional (with respect to any user-created transaction). That is, an aborted transaction that had an INSERT would still result in an incremented sequence.

maddyblue · 2020-08-10T20:29:23Z

Oh, it's for the above reason (atomic but not transactional sequence increments) that I'm not sure why you need that property during IMPORT. Users already can't have transactional sequences and INSERT ids be created out-of-order (for example, multiple INSERT transactions can finish in some order and have a different order for their sequence IDs). Why do IMPORTs need that property?

dt · 2020-08-10T21:00:19Z

@mjibson I'm not sure I follow. What property?

The plan is that when an IMPORT processor hits a row that needs a next-val, it'll go to an txn to atomically a) update of the sequence to take a chunk of IDs and b) record the file and line number at which that chunk was taken in the job progress. Then on resume, if we previously reserved a chunk for a given line, we'll use it. That is an important property of IMPORT, in that it ensures every time you re-run the same IMPORT (i.e. resume the same job), you produce the same KVs, so if you resume after a pause or crash, your uncheckpointed prior work is perfectly shadowed by your re-done version. If you abort the IMPORT, the "holes" are still in the sequence keyspace -- we updated it in a short-lived txn that committed on the first resume, regardless of what happens later.

maddyblue · 2020-08-10T21:05:01Z

I was referring to 3 in the previous comment:

Ideally, the nextval values in a file should be consecutive (i.e. +1 between adjacent rows).

dt · 2020-08-10T21:08:26Z

Ah, we sort of want that property since if used in keys, we'd like adjacent input rows to be adjacent KVs for ingestion performance. And simple incrementing within our reserved chunk is also just the easiest way to know when we've exhausted the chunk and know we need to get another one. But we don't make any effort to ensure sequential chunks of the same file have sequential IDs.

Anzoteh96 · 2020-08-10T21:15:56Z

Following the consideration of "determinism" (that is, on resume we produce the same KVs), an additional thing we'll need to do is to include both the value to be used next and the chunk limit (together with the resume position) when saving progress.

Anzoteh96 · 2020-08-11T13:21:23Z

Here's one tricky scenario that I can think of:

Imagine we saved at row 30 (say), last used value 1024, chunk limit 1100.

Let's also assume that after we save that, we use up more slots and reach our limit, ending up asking for more. Let's also assume that the next chunk is 1401-1500.
If the processor had to resume (after a failure, say), we'll resume at 1024, reach the chunk limit 1100. But now, it will have to remember that 1401-1500 belongs to the processor itself. Should we consider caching the chunks that a processor has asked for?

Separate from this, I concur with the idea of starting with a smaller chunk size and increasing this chunk size as we progress.

pbardea · 2020-08-11T13:35:02Z

But now, it will have to remember that 1401-1500 belongs to the processor itself.

We shouldn't need to remember which processor requested which chunk. All we need to know is which file and line number corresponds to which chunk. These (file num, row num, reserved-chunk) entries should be encoded in the import job details in the same txn that reserves the chunk. On resume, the processor should lookup if there is already a chunk reserved from the previous attempt.

mwang1026 · 2020-09-25T15:18:47Z

Closing this. We didn't get to next_val(sequence) so I'll create separate tracking for this. Awesome stuff @Anzoteh96 !

Previously, `nextval` was not supported as a default expression for a non-targeted import into column. This change adds that functionality for a CSV import. There is a lot of great discussion about the approach to this problem at cockroachdb#48253 (comment). At a high level, on encountering a nextval(seqname) for the first time, IMPORT will reserve a chunk of values for this sequence, and tie those values to the (fileIdx, rowNum) which is a unique reference to a particular row in a distributed import. The size of this chunk grows exponentially based on how many times a single processor encounters a nextval call for that particular sequence. The reservation of the chunk piggy backs on existing methods which provide atomic, non-transactional guarantees when it comes to increasing the value of a sequence. Information about the reserved chunks is stored in the import job progress details so as to ensure the following property: If the import job were to be paused and then resumed, assuming all the rows imported were not checkpointed, we need to ensure that the nextval value for a previously processed (fileIdx, rowNum) is identical to the value computed in the first run of the import job. This property is necessary to prevent duplicate entries with the same key but different value. We use the jobs progress details to check if we have a previously reserved chunk of sequence values which can be used for the current (fileIdx, rowNum). Release note (sql change): IMPORT INTO for CSV now supports nextval as a default expression of a non-targeted column.

56473: importccl: add `nextval` support for IMPORT INTO CSV r=miretskiy,pbardea a=adityamaru Previously, `nextval` was not supported as a default expression for a non-targeted import into column. This change adds that functionality for a CSV import. There is a lot of great discussion about the approach to this problem at #48253 (comment). At a high level, on encountering a nextval(seqname) for the first time, IMPORT will reserve a chunk of values for this sequence, and tie those values to the (fileIdx, rowNum) which is a unique reference to a particular row in a distributed import. The size of this chunk grows exponentially based on how many times a single processor encounters a nextval call for that particular sequence. The reservation of the chunk piggy backs on existing methods which provide atomic, non-transactional guarantees when it comes to increasing the value of a sequence. Information about the reserved chunks is stored in the import job progress details so as to ensure the following property: If the import job were to be paused and then resumed, assuming all the rows imported were not checkpointed, we need to ensure that the nextval value for a previously processed (fileIdx, rowNum) is identical to the value computed in the first run of the import job. This property is necessary to prevent duplicate entries with the same key but different value. We use the jobs progress details to check if we have a previously reserved chunk of sequence values which can be used for the current (fileIdx, rowNum). Informs: #54797 Release note (sql change): IMPORT INTO for CSV now supports nextval as a default expression of a non-targeted column. Co-authored-by: Aditya Maru <[email protected]>

pbardea added the A-disaster-recovery label Apr 30, 2020

dt mentioned this issue Jun 2, 2020

Handle default timestamps during IMPORT #42744

Closed

pbardea added the C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) label Jun 11, 2020

pbardea mentioned this issue Jun 11, 2020

importccl: support default expressions in IMPORT #49569

Closed

pbardea assigned Anzoteh96 Jun 11, 2020

Anzoteh96 mentioned this issue Jun 16, 2020

importccl: support DEFAULT columns for constant DEFAULT expressions #50295

Merged

craig bot closed this as completed in 9ac59f1 Jul 1, 2020

Anzoteh96 reopened this Jul 1, 2020

Anzoteh96 mentioned this issue Jul 2, 2020

importccl: support unique_rowid() as default expression for IMPORT INTO #50922

Merged

Anzoteh96 mentioned this issue Jul 13, 2020

importccl: support DEFAULT columns for current timestamp functions (like now()) for IMPORT INTO #51390

Merged

mwang1026 closed this as completed Sep 25, 2020

mwang1026 mentioned this issue Sep 25, 2020

import: support import with default next_val columns #54797

Closed

adityamaru mentioned this issue Nov 10, 2020

importccl: add nextval support for IMPORT INTO CSV #56473

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

importccl: support default expressions #48253

importccl: support default expressions #48253

pbardea commented Apr 30, 2020

pbardea commented Jun 11, 2020

Anzoteh96 commented Jun 16, 2020 •

edited

Loading

pbardea commented Jun 16, 2020

Anzoteh96 commented Jul 1, 2020 •

edited

Loading

Anzoteh96 commented Aug 10, 2020 •

edited

Loading

dt commented Aug 10, 2020

maddyblue commented Aug 10, 2020

maddyblue commented Aug 10, 2020

dt commented Aug 10, 2020

maddyblue commented Aug 10, 2020

dt commented Aug 10, 2020

Anzoteh96 commented Aug 10, 2020

Anzoteh96 commented Aug 11, 2020

pbardea commented Aug 11, 2020

mwang1026 commented Sep 25, 2020

importccl: support default expressions #48253

importccl: support default expressions #48253

Comments

pbardea commented Apr 30, 2020

pbardea commented Jun 11, 2020

Anzoteh96 commented Jun 16, 2020 • edited Loading

pbardea commented Jun 16, 2020

Anzoteh96 commented Jul 1, 2020 • edited Loading

Anzoteh96 commented Aug 10, 2020 • edited Loading

dt commented Aug 10, 2020

maddyblue commented Aug 10, 2020

maddyblue commented Aug 10, 2020

dt commented Aug 10, 2020

maddyblue commented Aug 10, 2020

dt commented Aug 10, 2020

Anzoteh96 commented Aug 10, 2020

Anzoteh96 commented Aug 11, 2020

pbardea commented Aug 11, 2020

mwang1026 commented Sep 25, 2020

Anzoteh96 commented Jun 16, 2020 •

edited

Loading

Anzoteh96 commented Jul 1, 2020 •

edited

Loading

Anzoteh96 commented Aug 10, 2020 •

edited

Loading