kuzudb · prrao87 · Nov 15, 2024 · Oct 18, 2024 · Nov 5, 2024 · Nov 6, 2024
diff --git a/public/img/cli/highlighting.png b/public/img/cli/highlighting.png
diff --git a/public/img/progress-bar.gif → public/img/cli/progress-bar.gif b/public/img/progress-bar.gif → public/img/cli/progress-bar.gif
diff --git a/src/content/docs/client-apis/cli.mdx b/src/content/docs/client-apis/cli.mdx
@@ -81,6 +81,10 @@ kuzu> :help
     :max_width [max_width]     set maximum width in characters for display
     :mode [mode]     set output mode (default: box)
     :stats [on|off]     toggle query stats on or off
+    :multiline     set multiline mode (default)
+    :singleline     set singleline mode
+    :highlight [on|off]     toggle syntax highlighting on or off
+    :render_errors [on|off]     toggle error highlighting on or off
 
     Note: you can change and see several system configurations, such as num-threads, 
           timeout, and progress_bar using Cypher CALL statements.
@@ -106,6 +110,46 @@ Set output mode. The default mode is `box`. See the output modes section below f
 #### `:stats [on|off]`
 Toggle query statistics on or off. The default is `on`. Query statistics include the number of tuples, columns, and execution time.
 
+#### `:multiline`
+Set multiline editing mode. This is the default editing mode. In multiline editing mode, you can write queries that span multiple lines. In this mode, you are able to go back
+to previous lines and edit them. Comments and newlines will stay when saved into history when using this mode. 
+
+```bash
+kuzu> :multiline
+Multi line mode enabled
+kuzu> CREATE NODE TABLE
+    · Person(name STRING,
+    · age INT64,
+    ‣ PRIMARY KEY (name) // a comment here too
+    · );
+```
+
+The `‣` symbol indicates the current line.
+
+#### `:singleline`
+Set singleline editing mode. In singleline editing mode, you can only write queries on a single line.
+If your query spans multiple lines, you will not be able to go back to previous lines and edit them. Single-line comments and newlines
+will not be saved into history when using this mode.
+
+```bash
+kuzu> :singleline
+Single line mode enabled
+kuzu> CREATE NODE TABLE
+..> Person(name STRING,
+..> age INT64,
+..> PRIMARY KEY (name) // a comment here too
+..> );
+```
+
+#### `:highlight [on|off]`
+Toggle syntax highlighting on or off. The default is `on`. When enabled, the shell will highlight Cypher keywords, 
+constants and literals, syntax errors, and comments. Error highlighting and multiline comment highlighting are not available in singleline mode.
+![](/img/cli/highlighting.png)
+
+#### `:render_errors [on|off]`
+Toggle error highlighting on or off. The default is `on`. When enabled, the shell will highlight syntax errors in red. In particular, 
+mismatched brackets and unclosed quotes will be highlighted. Error highlighting is not available in singleline mode.
+
 ## Interrupt shell
 
 To interrupt a running query, use `Ctrl + C` in CLI. Note: We currently don't support interrupting a running `COPY` statement.
@@ -156,7 +200,7 @@ the number of pipelines that have been executed (each query is broken down into
 as well as the percentage of the data processed in a pipeline. This gives an estimate for how much of a pipeline
 has executed.
 
-![](/img/progress-bar.gif)
+![](/img/cli/progress-bar.gif)
 
 The progress bar is not enabled by default. To enable the progress bar, use the following command:
 
@@ -198,4 +242,11 @@ kuzu> CREATE NODE TABLE Person (name STRING, age INT64, PRIMARY KEY(name));
 ```
 
 The `:max_rows` and `:max_width` commands can be used to control the number of rows and the width
-of the `box`, `column`, `table`, and `markdown` output modes.
+of the `box`, `column`, `table`, and `markdown` output modes.
+
+## Multi-line Cypher statements
+The CLI supports queries written in multiple lines. If a semicolon is omitted, hitting enter will allow users to continue the query in a newline instead of executing it.
+```
+kuzu> MATCH (a:person)
+    ‣ RETURN a.id;
+```
diff --git a/src/content/docs/client-apis/python.mdx b/src/content/docs/client-apis/python.mdx
@@ -377,20 +377,6 @@ types to a Kùzu `LogicalTypeID`, which will be used to infer types via Python t
 |`list`|`LIST`|
 |`dict`|`MAP`|
 
-### Nested types
-
-When defining a UDF, you can also specify nested types, though in this case, there are some differences
-from the example shown above.
-
-If the parameter is a nested type, you must also provide the children's type information. As such, with nested types,
-it's not valid to use `kuzu.Type`. Instead, a string representation of the type should be given.
-
-- A list of `INT64` would be `"INT64[]"`
-- A map from a `STRING` to a `DOUBLE` would be `"MAP(STRING, DOUBLE)"`.
-
-Note that it's also valid to define child types through Python's type annotations, e.g. `list[int]`,
-or `dict(str, float)`. It is also valid to use string representations to denote non-nested types.
-
 ## UDF
 
 Kùzu's Python API also supports the registration of User Defined Functions (UDFs),
@@ -490,3 +476,44 @@ In case you want to remove the UDF, you can call the `remove_function` method on
 # Use existing connection object
 conn.remove_function(difference)
 ```
+
+### Nested and complex types
+
+When working with UDFs, you can also specify nested or complex types, though in this case, there are some differences
+from the examples shown above. With these additional types, a string representation should be given
+for the parameters which are then manually cast to the respective Kùzu type.
+
+Some examples of where this is relevant are listed below:
+
+- A list of `INT64` would be `"INT64[]"`
+- A map from a `STRING` to a `DOUBLE` would be `"MAP(STRING, DOUBLE)"`
+- A Decimal value with 7 significant figures and 2 decimals would be `"DECIMAL(7, 2)"`
+
+Note that it's also valid to define child types through Python's type annotations, e.g. `list[int]`,
+or `dict(str, float)` for simple types.
+
+Below, we show an example to calculate the discounted price of an item using a Python UDF.
+
+```python
+def calculate_discounted_price(price: float, has_discount: bool) -> float:
+    # Assume 10% discount on all items for simplicity
+    return float(price) * 0.9 if has_discount else price
+
+# define the expected type of the UDF's parameters
+parameters = ['DECIMAL(7, 2)', kuzu.Type.BOOL]
+
+# define expected type of the UDF's returned value
+return_type = 'DECIMAL(7, 2)'
+
+# register the UDF
+conn.create_function(
+    "current_price",
+    calculate_discounted_price,
+    parameters,
+    return_type
+)
+```
+
+The second parameter is a built-in native type in Kùzu, i.e., `kuzu.Type.BOOL`. For the first parameter,
+we need to specify a string, i.e. `"DECIMAL(7,2)"` that's then parsed and used by the binder in Kùzu
+to map to the internal Decimal representation.
diff --git a/src/content/docs/cypher/configuration.md b/src/content/docs/cypher/configuration.md
@@ -8,17 +8,19 @@ statement, described in this section. Different from [the `CALL` clause](/cypher
 configuration **cannot** be used with other query clauses, such as `RETURN`.
 
 ### Connection configuration
-| Option | Description                                                                    | Default                |
-| ----------- |--------------------------------------------------------------------------------|------------------------|
-| `THREADS` | number of threads used by execution                                            | system maximum threads |
-| `TIMEOUT` | timeout of query execution in ms                                               | N/A                    |
-| `VAR_LENGTH_EXTEND_MAX_DEPTH` | maximum depth of recursive extend                                              | 30                     |
-| `ENABLE_SEMI_MASK` | enables the semi mask optimization                                             | true                   |
-| `HOME_DIRECTORY`| system home directory                                                          | user home directory    |
-| `FILE_SEARCH_PATH`| file search path                                                               | N/A                    |
-| `PROGRESS_BAR` | enable progress bar in CLI                                                     | false                  |
-| `PROGRESS_BAR_TIME` | show progress bar after time in ms                                             | 1000                   |
-| `CHECKPOINT_THRESHOLD` | the WAL size threshold in bytes at which to automatically trigger a checkpoint | 16777216 (16MB)        |
+| Option | Description                                                                                                                                                                                                                                                               | Default                |
+| ----------- |---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------|
+| `THREADS` | number of threads used by execution                                                                                                                                                                                                                                       | system maximum threads |
+| `TIMEOUT` | timeout of query execution in ms                                                                                                                                                                                                                                          | N/A                    |
+| `VAR_LENGTH_EXTEND_MAX_DEPTH` | maximum depth of recursive extend                                                                                                                                                                                                                                         | 30                     |
+| `ENABLE_SEMI_MASK` | enables the semi mask optimization                                                                                                                                                                                                                                        | true                   |
+| `HOME_DIRECTORY`| system home directory                                                                                                                                                                                                                                                     | user home directory    |
+| `FILE_SEARCH_PATH`| file search path                                                                                                                                                                                                                                                          | N/A                    |
+| `PROGRESS_BAR` | enable progress bar in CLI                                                                                                                                                                                                                                                | false                  |
+| `PROGRESS_BAR_TIME` | show progress bar after time in ms                                                                                                                                                                                                                                        | 1000                   |
+| `CHECKPOINT_THRESHOLD` | the WAL size threshold in bytes at which to automatically trigger a checkpoint                                                                                                                                                                                            | 16777216 (16MB)        |
+| `WARNING_LIMIT` | Maximum number of warnings that can be stored in a single connection. Currently only the warnings related to [malformed CSV lines](/import/csv#ignoring-erroneous-rows) are stored if `ignore_errors` parameter is set to true in `COPY FROM` and `LOAD FROM` statements. | 8192        |
+| `SPILL_TO_DISK_TMP_FILE` | The location of the temporary file to use to store data if there is not enough memory during a copy                                                                                                                                                                       | `copy.tmp` inside the database directory |
 
 ### Database configuration
 | Option | Description | Default |
@@ -67,4 +69,16 @@ CALL progress_bar=true;
 #### Configure checkpoint threshold
 ```cypher
 CALL checkpoint_threshold=16777216;
-```
+```
+
+#### Configure warning limit
+```cypher
+CALL warning_limit=1024;
+```
+
+#### Configure Spill to disk temporary file
+```cypher
+CALL spill_to_disk_tmp_file="/path/to/tmp/file";
+# Disables spilling to disk
+CALL spill_to_disk_tmp_file="";
+```
diff --git a/src/content/docs/cypher/data-definition/create-table.md b/src/content/docs/cypher/data-definition/create-table.md
@@ -38,7 +38,13 @@ To create a node table, use the `CREATE NODE TABLE` statement as shown below:
 ```sql
 CREATE NODE TABLE User (name STRING, age INT64 DEFAULT 0, reg_date DATE, PRIMARY KEY (name))
 ```
-The above statement adds a `User` table to the catalog of the system with three properties: `name`, `age`, and `reg_date`,
+
+Alternatively, you can specify the keyword `PRIMARY KEY` immediately after the column name, as follows:
+```sql
+CREATE NODE TABLE User (name STRING PRIMARY KEY, age INT64 DEFAULT 0, reg_date DATE)
+```
+
+The above statements adds a `User` table to the catalog of the system with three properties: `name`, `age`, and `reg_date`,
 with the primary key being set to the `name` property in this case.
 
 The name of the node table, `User`, specified above will serve as the "label" which we want to query
@@ -49,7 +55,7 @@ MATCH (a:User) RETURN *
 
 ### Primary key
 
-Kùzu requires a primary key column for node table which can be either a `STRING` or `INT64` property of the node. Kùzu will generate an index to do quick lookups on the primary key (e.g., `name` in the above example). Alternatively, you can use the [`SERIAL`](/cypher/data-types/#serial) data type to generate an auto-increment column as primary key.
+Kùzu requires a primary key column for node table which can be either a `STRING`, numeric, `DATE`, or `BLOB` property of the node. Kùzu will generate an index to do quick lookups on the primary key (e.g., `name` in the above example). Alternatively, you can use the [`SERIAL`](/cypher/data-types/#serial) data type to generate an auto-increment column as primary key.
 
 ### Default value
 

diff --git a/src/content/docs/cypher/data-manipulation-clauses/example-database.md b/src/content/docs/cypher/data-manipulation-clauses/example-database.md
@@ -15,7 +15,7 @@ are shown below.
 ### User nodes
 Schema:
 ```cypher
-CREATE NODE TABLE User(name STRING, age INT64, PRIMARY KEY (name))
+CREATE NODE TABLE User(name STRING, age INT64 DEFAULT 0, PRIMARY KEY (name))
 ```
 
 user.csv: