Skip to content

Commit

Permalink
wallet/db_sqlite3.c: Support direct replication of SQLITE3 backends.
Browse files Browse the repository at this point in the history
ChangeLog-Added: With the `sqlite3://` scheme for `--wallet` option, you can now specify a second file path for real-time database backup by separating it from the main file path with a `:` character.
  • Loading branch information
ZmnSCPxj committed Oct 27, 2021
1 parent 091a6d9 commit b4198cc
Show file tree
Hide file tree
Showing 4 changed files with 228 additions and 15 deletions.
61 changes: 61 additions & 0 deletions doc/BACKUP.md
Original file line number Diff line number Diff line change
Expand Up @@ -82,6 +82,67 @@ any in-channel funds.
To recover in-channel funds, you need to use one or more of the other
backup strategies below.

## SQLITE3 `--wallet` `:` And Remote NFS Mount

`/!\` WHO SHOULD DO THIS: Casual users.

`/!\` **CAUTION** `/!\` This technique is only supported on 0.10.3
or later.
On earlier versions, the `:` character is not special and will be
considered part of the path of the database file.

When using the SQLITE3 backend (the default), you can specify a
second database file to replicate to, by separating the second
file with a single `:` character in the `--wallet` option, after
the main database filename.

For example, if the user running `lightningd` is named `user`, and
you are on the Bitcoin mainnet with the default `${LIGHTNINGDIR}`, you
can specify in your `config` file:

wallet=sqlite3:///home/user/.lightning/bitcoin/lightningd.sqlite3:/my/backup/lightningd.sqlite3

Or via command line:

lightningd --wallet=sqlite3:///home/user/.lightning/bitcoin/lightningd.sqlite3:/my/backup/lightningd.sqlite3

If the second database file does not exist but the directory that would
contain it does exist, the file is created.
If the directory of the second database file does not exist, `lightningd` will
fail at startup.
If the second database file already exists, on startup it will be overwritten
with the main database.
During operation, all database updates will be done on both databases.

This has the advantage compared to the `backup` plugin below of requiring
exactly the same amount of space on both the main and backup storage.
The `backup` plugin will take more space on the backup than on the main
storage.
It has the disadvantage that it will only work with the SQLITE3 backend and
is not supported by the PostgreSQL backend, and is unlikely to be supported
on any future database backends.

You can only specify *one* replica.

It is recommended that you use a network-mounted filesystem for the backup
destination.
For example, if you have a NAS you can access remotely.
At the minimum, set the backup to a different storage device.

Do note that files are not stored encrypted, so you should really not do
this with rented space ("cloud storage").

To recover, simply get the second database file, which is a replica of the
main database file.

If your backup destination is a network-mounted filesystem that is in a
remote location, then even loss of all hardware in one location will allow
you to still recover your Lightning funds.

However, if instead you are just replicating the database on another
storage device in a single location, you remain vulnerable to disasters
like fire or computer confiscation.

## `backup` Plugin And Remote NFS Mount

`/!\` WHO SHOULD DO THIS: Casual users.
Expand Down
7 changes: 7 additions & 0 deletions doc/lightningd-config.5.md
Original file line number Diff line number Diff line change
Expand Up @@ -193,6 +193,13 @@ The default wallet corresponds to the following DSN:
--wallet=sqlite3://$HOME/.lightning/bitcoin/lightningd.sqlite3
```

For the `sqlite3` scheme, you can specify a single backup database file
by separating it with a `:` character, like so:

```
--wallet=sqlite3://$HOME/.lightning/bitcoin/lightningd.sqlite3:/backup/lightningd.sqlite3
```

The following is an example of a postgresql wallet DSN:

```
Expand Down
34 changes: 34 additions & 0 deletions tests/test_db.py
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@
import base64
import os
import pytest
import shutil
import time
import unittest

Expand Down Expand Up @@ -379,3 +380,36 @@ def test_local_basepoints_cache(bitcoind, node_factory):
# after we verified.
l1.restart()
l2.restart()


@unittest.skipIf(os.getenv('TEST_DB_PROVIDER', 'sqlite3') != 'sqlite3', "Tests a feature unique to SQLITE3 backend")
def test_sqlite3_builtin_backup(bitcoind, node_factory):
l1 = node_factory.get_node(start=False)

# Figure out the path to the actual db.
main_db_file = l1.db.path
# Create a backup copy in the same location with the suffix .bak
backup_db_file = main_db_file + ".bak"

# Provide the --wallet option and start.
l1.daemon.opts['wallet'] = "sqlite3://" + main_db_file + ':' + backup_db_file
l1.start()

# Get an address and put some funds.
addr = l1.rpc.newaddr()['bech32']
bitcoind.rpc.sendtoaddress(addr, 1)
bitcoind.generate_block(1)
wait_for(lambda: len(l1.rpc.listfunds()['outputs']) == 1)

# Stop the node.
l1.stop()

# Copy the backup over the main db file.
shutil.copyfile(backup_db_file, main_db_file)

# Remove the --wallet option and start.
del l1.daemon.opts['wallet']
l1.start()

# Should still see the funds.
assert(len(l1.rpc.listfunds()['outputs']) == 1)
141 changes: 126 additions & 15 deletions wallet/db_sqlite3.c
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,27 @@
#if HAVE_SQLITE3
#include <sqlite3.h>

struct db_sqlite3 {
/* The actual db connection. */
sqlite3 *conn;
/* A replica db connection, if requested, or NULL otherwise. */
sqlite3 *backup_conn;
/* The backup object for the replica db connection. */
sqlite3_backup *backup;
};

/**
* @param conn: The db->conn void * pointer.
*
* @return the actual sqlite3 connection.
*/
static inline
sqlite3 *conn2sql(void *conn)
{
struct db_sqlite3 *wrapper = (struct db_sqlite3 *) conn;
return wrapper->conn;
}

#if !HAVE_SQLITE3_EXPANDED_SQL
/* Prior to sqlite3 v3.14, we have to use tracing to dump statements */
static void trace_sqlite3(void *stmtv, const char *stmt)
Expand All @@ -17,39 +38,85 @@ static void trace_sqlite3(void *stmtv, const char *stmt)
static const char *db_sqlite3_fmt_error(struct db_stmt *stmt)
{
return tal_fmt(stmt, "%s: %s: %s", stmt->location, stmt->query->query,
sqlite3_errmsg(stmt->db->conn));
sqlite3_errmsg(conn2sql(stmt->db->conn)));
}

static bool db_sqlite3_setup(struct db *db)
{
char *filename;
char *sep;
char *backup_filename = NULL;
sqlite3_stmt *stmt;
sqlite3 *sql;
int err, flags = SQLITE_OPEN_READWRITE | SQLITE_OPEN_CREATE;

struct db_sqlite3 *wrapper;

if (!strstarts(db->filename, "sqlite3://") || strlen(db->filename) < 10)
db_fatal("Could not parse the wallet DSN: %s", db->filename);

/* Strip the scheme from the dsn. */
filename = db->filename + strlen("sqlite3://");
/* Look for a replica specification. */
sep = strchr(filename, ':');
if (sep) {
/* Split at ':'. */
filename = tal_strndup(db, filename, sep - filename);
backup_filename = tal_strdup(db, sep + 1);
}

wrapper = tal(db, struct db_sqlite3);
db->conn = wrapper;

err = sqlite3_open_v2(filename, &sql, flags, NULL);

if (err != SQLITE_OK) {
db_fatal("failed to open database %s: %s", filename,
sqlite3_errstr(err));
}
db->conn = sql;
wrapper->conn = sql;

if (!backup_filename) {
wrapper->backup_conn = NULL;
wrapper->backup = NULL;
} else {
err = sqlite3_open_v2(backup_filename,
&wrapper->backup_conn,
flags, NULL);
if (err != SQLITE_OK) {
db_fatal("failed to open backup database %s: %s",
backup_filename,
sqlite3_errstr(err));
}

wrapper->backup = sqlite3_backup_init(wrapper->backup_conn,
"main",
wrapper->conn,
"main");
if (!wrapper->backup) {
db_fatal("failed to setup backup on database %s: %s",
backup_filename,
sqlite3_errmsg(wrapper->backup_conn));
}

/* Initial copy. */
err = sqlite3_backup_step(wrapper->backup, -1);
if (err != SQLITE_DONE) {
db_fatal("Failed initial backup: %s",
sqlite3_errstr(err));
}
}

/* In case another process (litestream?) grabs a lock, we don't
* want to return SQLITE_BUSY immediately (which will cause a
* fatal error): give it 60 seconds.
* We *could* make this an option, but surely the user prefers a
* long timeout over an outright crash.
*/
sqlite3_busy_timeout(db->conn, 60000);
sqlite3_busy_timeout(conn2sql(db->conn), 60000);

sqlite3_prepare_v2(db->conn, "PRAGMA foreign_keys = ON;", -1, &stmt, NULL);
sqlite3_prepare_v2(conn2sql(db->conn),
"PRAGMA foreign_keys = ON;", -1, &stmt, NULL);
err = sqlite3_step(stmt);
sqlite3_finalize(stmt);
return err == SQLITE_DONE;
Expand All @@ -58,7 +125,7 @@ static bool db_sqlite3_setup(struct db *db)
static bool db_sqlite3_query(struct db_stmt *stmt)
{
sqlite3_stmt *s;
sqlite3 *conn = (sqlite3*)stmt->db->conn;
sqlite3 *conn = conn2sql(stmt->db->conn);
int err;

err = sqlite3_prepare_v2(conn, stmt->query->query, -1, &s, NULL);
Expand Down Expand Up @@ -110,7 +177,7 @@ static bool db_sqlite3_exec(struct db_stmt *stmt)
#if !HAVE_SQLITE3_EXPANDED_SQL
/* Register the tracing function if we don't have an explicit way of
* expanding the statement. */
sqlite3_trace(stmt->db->conn, trace_sqlite3, stmt);
sqlite3_trace(conn2sql(stmt->db->conn), trace_sqlite3, stmt);
#endif

if (!db_sqlite3_query(stmt)) {
Expand Down Expand Up @@ -140,7 +207,7 @@ static bool db_sqlite3_exec(struct db_stmt *stmt)
#if !HAVE_SQLITE3_EXPANDED_SQL
/* Unregister the trace callback to avoid it accessing the potentially
* stale pointer to stmt */
sqlite3_trace(stmt->db->conn, NULL, NULL);
sqlite3_trace(conn2sql(stmt->db->conn), NULL, NULL);
#endif

return success;
Expand All @@ -156,7 +223,8 @@ static bool db_sqlite3_begin_tx(struct db *db)
{
int err;
char *errmsg;
err = sqlite3_exec(db->conn, "BEGIN TRANSACTION;", NULL, NULL, &errmsg);
err = sqlite3_exec(conn2sql(db->conn),
"BEGIN TRANSACTION;", NULL, NULL, &errmsg);
if (err != SQLITE_OK) {
db->error = tal_fmt(db, "Failed to begin a transaction: %s", errmsg);
return false;
Expand All @@ -168,11 +236,35 @@ static bool db_sqlite3_commit_tx(struct db *db)
{
int err;
char *errmsg;
err = sqlite3_exec(db->conn, "COMMIT;", NULL, NULL, &errmsg);

struct db_sqlite3 *wrapper = (struct db_sqlite3 *) db->conn;

err = sqlite3_exec(conn2sql(db->conn),
"COMMIT;", NULL, NULL, &errmsg);
if (err != SQLITE_OK) {
db->error = tal_fmt(db, "Failed to commit a transaction: %s", errmsg);
return false;
}

if (wrapper->backup) {
/* This *should* be fast:
* https://sqlite.org/c3ref/backup_finish.html#sqlite3backupstep
* "If the source database is modified by using the same database
* connection as is used by the backup operation, then the backup
* database is automatically updated at the same time."
* So the `COMMIT;` should have updated the backup too, and the
* sqlite3_backup_step should return quickly.
*/
err = sqlite3_backup_step(wrapper->backup, -1);
if (err != SQLITE_DONE) {
db->error = tal_fmt(db,
"Failed to replicate transaction "
"to backup: %s",
sqlite3_errstr(err));
return false;
}
}

return true;
}

Expand Down Expand Up @@ -221,19 +313,26 @@ static void db_sqlite3_stmt_free(struct db_stmt *stmt)

static size_t db_sqlite3_count_changes(struct db_stmt *stmt)
{
sqlite3 *s = stmt->db->conn;
sqlite3 *s = conn2sql(stmt->db->conn);
return sqlite3_changes(s);
}

static void db_sqlite3_close(struct db *db)
{
sqlite3_close(db->conn);
db->conn = NULL;
struct db_sqlite3 *wrapper = (struct db_sqlite3 *) db->conn;

if (wrapper->backup) {
sqlite3_backup_finish(wrapper->backup);
sqlite3_close(wrapper->backup_conn);
}
sqlite3_close(wrapper->conn);

db->conn = tal_free(db->conn);
}

static u64 db_sqlite3_last_insert_id(struct db_stmt *stmt)
{
sqlite3 *s = stmt->db->conn;
sqlite3 *s = conn2sql(stmt->db->conn);
return sqlite3_last_insert_rowid(s);
}

Expand All @@ -242,12 +341,24 @@ static bool db_sqlite3_vacuum(struct db *db)
int err;
sqlite3_stmt *stmt;

sqlite3_prepare_v2(db->conn, "VACUUM;", -1, &stmt, NULL);
struct db_sqlite3 *wrapper = (struct db_sqlite3 *) db->conn;

sqlite3_prepare_v2(conn2sql(db->conn), "VACUUM;", -1, &stmt, NULL);
err = sqlite3_step(stmt);
if (err != SQLITE_DONE)
db->error = tal_fmt(db, "%s", sqlite3_errmsg(db->conn));
db->error = tal_fmt(db, "%s",
sqlite3_errmsg(conn2sql(db->conn)));
sqlite3_finalize(stmt);

if (err == SQLITE_DONE && wrapper->backup) {
err = sqlite3_backup_step(wrapper->backup, -1);
if (err != SQLITE_DONE)
db->error = tal_fmt(db,
"Failed to replicate VACUUM "
"to backup: %s",
sqlite3_errstr(err));
}

return err == SQLITE_DONE;
}

Expand Down

0 comments on commit b4198cc

Please sign in to comment.