Simple persistence #9

jgraeger · 2022-06-04T17:16:25Z

Fixes #7

I fucked up a local rebase and realized to late, thats why i confused you all with multiple (now closed pull requests; #1, #8). Sorry for that. This PR is now the real deal ☺️

jgraeger · 2022-06-04T18:33:27Z

PR is ready for review. Maybe, now that the services actually persists data in an append only manner, we should possibly rename it to Kodak

ldb

While I like the idea of the append-only log, in this implementation I see significant risks to performance once the user base scales up. Not that it matters too much at this stage, but we will have to benchmark our whole system towards the end of the project and I feel like the complexity is rather high for this tradeoff. This service in particular needs to be very robust with the data it's handling, so I'd like to see at least a couple more tests and a benchmark or two before I'd feel comfortable green lighting this, sorry 😅. Kill Once Destroy All, after all.

ldb · 2022-06-04T18:48:35Z

logstore/errors.go

+	return fmt.Sprintf("no value for key: %v", e.key)
+}
+
+func IsNotFoundError(err error) bool {


Why make it this complex? Wouldn't it be enough to only expose something like

var ErrNotFound = errors.New("key not found")

and let the caller identify the error using errors.As and errors.Is.

ldb · 2022-06-04T18:49:55Z

logstore/record.go

+)
+
+const (
+	kindValue = iota


It would be good to have the default value be a special case like kindEmpty to make identifying uninitialised records easy.

ldb · 2022-06-04T18:50:17Z

logstore/record.go

+
+// ErrInsufficientData is returned when the given data is not enouch to be
+// parsed into a Record
+var ErrInsufficientData = errors.New("insufficient bytes to parse a record")


Is this an error that would ever be exposed to the caller?

ldb · 2022-06-04T18:53:37Z

logstore/record.go

+	readBuf := bytes.NewBuffer(data)
+
+	checksum := uint32(binary.BigEndian.Uint32(readBuf.Next(checksumSize)))
+	kind, _ := readBuf.ReadByte()


Unchecked error

ldb · 2022-06-04T18:55:22Z

logstore/record.go

+		return nil, ErrCorruptData
+	}
+
+	return &Record{


Why return a pointer here?

ldb · 2022-06-04T19:00:40Z

logstore/scanner.go

@@ -0,0 +1,46 @@
+package logstore


No tests for the scanner? :(

ldb · 2022-06-04T19:14:30Z

logstore/store.go

+}
+
+func (s *Store) Get(key string) ([]byte, error) {
+	f, err := os.Open(s.storagePath)


Each call to Get opens the file anew and creates a new scanner.

I am thinking that it would be much more efficient to implement the store using a sync.Pool of custom scanners that use an io.ReadSeeker (an os.File is an io.Seeker).

The problem is generally that right now Get is O(n) instead of O(1), which scales .. not great 😅

ldb · 2022-06-04T19:15:21Z

logstore/store.go

+	}
+	defer f.Close()
+
+	scanner, err := NewScanner(f, s.maxRecordSize)


This is a huge allocation for every request that enters our system

ldb · 2022-06-04T19:35:54Z

logstore/store_test.go

+}
+
+func TestStore(t *testing.T) {
+	dir, err := initWorkdir()


Again, would be good to have some more tests, especially around records with long values, and a high number of records

ldb · 2022-06-04T19:40:21Z

main.go

 }

 func main() {
 	flag.Parse()

+	workdir, err := os.Getwd()


The database file should not be stored in the working directory, as this has to be mounted in externally. You would have to change directory there first, after mounting it. Instead, I'd go for a well known path, like /data/db (what MongoDB uses).

ldb · 2022-06-04T19:55:40Z

I mean, you are very obviously aware of the issue (after reading #10 ).

I am thinking do we really need to store every record change as a new record? You mentioned compliance as a reason to know when a user was disabled for example, but we could simply store all these things in singular records. I feel like a simple hash map in memory that gets flushed to disk on every change would scale much better than this and be much less risky.

ldb · 2022-06-08T07:36:07Z

This was tested on dev @jgraeger are you fine with merging this just so to get it out of the review queue?

jgraeger · 2022-06-08T07:42:42Z

Sure. Merge #13 and close this PR as it's basically rejected in the current state. I will open a new PR for the next iteration as discussed when it's ready.

ldb · 2022-06-11T11:39:05Z

Closing as this was fixed by #13.
Not deleting to serve as future reference

jgraeger and others added 9 commits June 4, 2022 19:12

Records

1775e4a

Testing record serialization

7892810

Temporary store commit

00aa850

Scanner for logfile

4482520

Naive database implementation

cd20a93

Tests for store

bce0ed2

Easier handling for exported DB errors

3436138

Provide example request in the repo

0b928b7

Getting Koda persisting

e3da92d

jgraeger changed the title ~~Implements simple persistence~~ Simple persistence Jun 4, 2022

jgraeger mentioned this pull request Jun 4, 2022

Better persistence #10

Open

2 tasks

jgraeger marked this pull request as ready for review June 4, 2022 18:32

jgraeger requested review from ldb and akkbng June 4, 2022 18:33

jgraeger mentioned this pull request Jun 4, 2022

Deploy Koda with persistence mindtastic/deployments#41

Closed

2 tasks

ldb requested changes Jun 4, 2022

View reviewed changes

ldb mentioned this pull request Jun 6, 2022

Implement LocalFileStore as small persistence for the time being (and some other stuff) #13

Merged

ldb closed this Jun 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple persistence #9

Simple persistence #9

jgraeger commented Jun 4, 2022 •

edited

Loading

jgraeger commented Jun 4, 2022 •

edited

Loading

ldb left a comment •

edited

Loading

ldb Jun 4, 2022

ldb Jun 4, 2022

ldb Jun 4, 2022

ldb Jun 4, 2022

ldb Jun 4, 2022

ldb Jun 4, 2022

ldb Jun 4, 2022

ldb Jun 4, 2022

ldb Jun 4, 2022

ldb Jun 4, 2022

ldb commented Jun 4, 2022

ldb commented Jun 8, 2022

jgraeger commented Jun 8, 2022

ldb commented Jun 11, 2022

Simple persistence #9

Simple persistence #9

Conversation

jgraeger commented Jun 4, 2022 • edited Loading

jgraeger commented Jun 4, 2022 • edited Loading

ldb left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ldb commented Jun 4, 2022

ldb commented Jun 8, 2022

jgraeger commented Jun 8, 2022

ldb commented Jun 11, 2022

jgraeger commented Jun 4, 2022 •

edited

Loading

jgraeger commented Jun 4, 2022 •

edited

Loading

ldb left a comment •

edited

Loading