Increase data sizes #138

dahankzter · 2019-06-13T10:50:08Z

This is an attempt at simply generate more data on disk. Tuples and UDT's have more fields and stringly types are made significantly longer.

Fixes: #121
Fixes: #75

The number of partition keys, clustering keys and regular columns can now be configured by the user via CLI args.

penberg · 2019-06-17T07:14:49Z

schema.go

-)
-
-func GenSchema(cs *CompactionStrategy) *Schema {
+func GenSchema(cs *CompactionStrategy, maxPartitionKeys, maxClusteringKeys, maxColumns int) *Schema {


We could wrap these configuration parameters to a SchemaConfig struct in a follow-up patch.

Makes sense and while it gives significant refactoring possibilities it also allows the developer to "forget to apply" a certain config but that can happen with an argument as well I guess.

I can do it now. It's a quick fix and makes it much nicer.

SchemaConfig added.

penberg · 2019-06-17T07:15:17Z

schema.go

 	}
 	var mvs []MaterializedView
 	numMvs := 1
 	for i := 0; i < numMvs; i++ {
+		col, err := validMVColumn()
+		if err != nil {
+			fmt.Println(err)


Perhaps something more structured for error reporting?

I don't know what to do in this case though. It simply means that we didn't generate a proper column that can take part in the MV.

Better message added.

penberg · 2019-06-17T07:16:23Z

types.go

@@ -39,7 +39,10 @@ const (
 	TYPE_VARCHAR   = SimpleType("varchar")
 	TYPE_VARINT    = SimpleType("varint")

-	MaxUDTParts = 10
+	MaxStringLength = 1000


This limits blob maximum size too, no? We should allow much larger blobs (for example, 1 MB or even 10 MB).

Ah yes I can add a separate one for blobs.

It's 1e6 bytes now perhaps enough to start with?

penberg · 2019-06-17T09:22:57Z

Looks good, please merge.

Henrik Johansson added 3 commits June 13, 2019 12:16

schema: max tuple parts incresed to 20

69cef24

schema: max udt parts incresed to 20

1986781

schema: string type lengths between 100 and 1000

d06af61

dahankzter requested a review from penberg June 13, 2019 10:50

schema: max column counts configurable via CLI arg

469ff82

The number of partition keys, clustering keys and regular columns can now be configured by the user via CLI args.

penberg reviewed Jun 17, 2019

View reviewed changes

Henrik Johansson added 3 commits June 17, 2019 10:18

schema: added SchemaConfig for code clarity

9e3f682

schema: increase the max size of blobs

9c981c5

schema: clarifying failure message when trying to generate MV columns

a2332f5

dahankzter merged commit fad33c4 into master Jun 17, 2019

penberg deleted the increase_data_sizes branch December 11, 2019 07:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase data sizes #138

Increase data sizes #138

dahankzter commented Jun 13, 2019 •

edited

Loading

penberg Jun 17, 2019

dahankzter Jun 17, 2019

dahankzter Jun 17, 2019

dahankzter Jun 17, 2019

penberg Jun 17, 2019

dahankzter Jun 17, 2019 •

edited

Loading

dahankzter Jun 17, 2019

penberg Jun 17, 2019

dahankzter Jun 17, 2019

dahankzter Jun 17, 2019

penberg commented Jun 17, 2019

Increase data sizes #138

Increase data sizes #138

Conversation

dahankzter commented Jun 13, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dahankzter Jun 17, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

penberg commented Jun 17, 2019

dahankzter commented Jun 13, 2019 •

edited

Loading

dahankzter Jun 17, 2019 •

edited

Loading