Add support for UUID version 7 #19

nevans · 2023-06-29T22:48:52Z

Although the specification for UUIDv7 is still in draft, the UUIDv7 algorithm has been relatively stable as it progresses to completion.

Version 7 UUIDs can be very useful, because they are lexographically sortable, which can improve e.g: database index locality. See section 6.10 of the draft specification for further explanation:

https://datatracker.ietf.org/doc/draft-ietf-uuidrev-rfc4122bis/

The specification allows up to 12 bits of extra timestamp precision, to make UUID generation closer to monotonically increasing. This provides between 1ms and ~240ns of timestamp precision. At the cost of some code complexity and a small performance penalty, a kwarg may specify any arbitrary precision between 0 and 12 extra bits. Any stronger guarantees of monotonicity have considerably larger tradeoffs, so nothing more is implemented. This limitation is documented.

Ruby issue: https://bugs.ruby-lang.org/issues/19735

nevans · 2023-06-29T22:52:35Z

FYI: see an earlier PR here: #15. I wasn't aware of that other PR when I first started on mine. The primary distinction from that PR is that this allows up to 12 extra bits of timestamp precision, at the cost of code complexity.

lib/random/formatter.rb

nevans · 2023-07-03T12:05:48Z

lib/random/formatter.rb

+  # {Section 6.2}[https://www.ietf.org/archive/id/draft-ietf-uuidrev-rfc4122bis-07.html#monotonicity_counters]
+  # of the specification.
+  #
+  def uuid_v7(extra_timestamp_bits: 0)


Any suggestions for a better keyword parameter name?

e12e · 2023-07-24T10:15:16Z

lib/random/formatter.rb

+        rand.unpack("H4H12").join("-")
+      ]
+
+    when (0..12) # the generic version is slower than the special cases above


Would this be clearer as when (1..11) ?

I don't know. :) I can go either way:

On the one hand, because 0 and 12 are handled by the prior clauses, it makes sense to only show the numbers that will be handled by this clause.

On the other hand, since this is the generic version, it can handle 0..12, and it's nice to "document" that here.

Although I did consider this earlier, I think I left it as 0..12 mostly by accident: While I was testing and benchmarking the code, I would copy/paste the entire method and then comment out or delete one of the other clauses. So it was temporarily simpler to keep this as 0..12.

So, what do you think?

e12e · 2023-07-24T10:16:04Z

Any updates on this or sibling for including UUIDv7 support?

unak

LGTM

mame · 2023-09-14T10:40:40Z

lib/random/formatter.rb

+  #
+  # See draft-ietf-uuidrev-rfc4122bis[https://datatracker.ietf.org/doc/draft-ietf-uuidrev-rfc4122bis/]
+  # for details of UUIDv7.
+  #


@nevans It would be good to note that fixing the random number seed (e.g., by Kernel#srand) does not make this method reproducible.

Good point. I'll add a note for that. If I remember correctly, I had originally written a version that allowed a keyword argument for the timestamp. But when I changed the code to allow extra_timestamp_bits, it didn't seem like it was worth the complexity.

What do you think about the following:

Suggested change

#

#

# Note that this method cannot be made reproducable with Kernel#srand, which

# can only affect the random bits. The sorted bits will still be based on

# Process.clock_gettime.

#

Looks good, thanks!

@nevans Ah, there are some other ways to fix the random seed than Kernel#srand, such as: Random.new(0).uuid_v7.

How about this? This PR is already merged, so I will create another PR if you are OK.

Note that this method cannot be made reproducable because its output includes not only random bits but also timestamp.

@mame Looks good. I like your text better than mine. It's simpler, clearer, and covers the Random.new(0) scenario too. :)

hsbt · 2023-09-15T01:21:52Z

@nevans Thank you for submitting this.

Can you confirm #19 (comment)? I'll merge this after that.

nevans · 2023-09-16T17:55:02Z

@nevans Thank you for submitting this.

Can you confirm #19 (comment)? I'll merge this after that.

I pushed a new version with the change I suggested here: #19 (comment). Is that good?

Thanks, @mame and @hsbt. 🙂

lib/random/formatter.rb

Although the specification for UUIDv7 is still in draft, the UUIDv7 algorithm has been relatively stable as it progresses to completion. Version 7 UUIDs can be very useful, because they are lexographically sortable, which can improve e.g: database index locality. See section 6.10 of the draft specification for further explanation: https://datatracker.ietf.org/doc/draft-ietf-uuidrev-rfc4122bis/ The specification allows up to 12 bits of extra timestamp precision, to make UUID generation closer to monotonically increasing. This provides between 1ms and ~240ns of timestamp precision. At the cost of some code complexity and a small performance penalty, a kwarg may specify any arbitrary precision between 0 and 12 extra bits. Any stronger guarantees of monotonicity have considerably larger tradeoffs, so nothing more is implemented. This limitation is documented. Ruby issue: https://bugs.ruby-lang.org/issues/19735

nevans · 2023-09-18T21:09:04Z

@hsbt I addressed @rhenium's comment. And I hadn't noticed that the build was failing for ruby 2.6 (the tests were using Time.at ts, in: "Z"), so I just pushed an update that fixes that.

nevans mentioned this pull request Jun 29, 2023

Add UUID v7 support #15

Closed

nevans mentioned this pull request Jun 29, 2023

Add support for UUID version 7 ruby/ruby#7953

Closed

nevans commented Jun 29, 2023

View reviewed changes

lib/random/formatter.rb Outdated Show resolved Hide resolved

nevans commented Jul 3, 2023

View reviewed changes

nevans force-pushed the uuid_v7 branch 3 times, most recently from 9972161 to 63d18fd Compare July 3, 2023 14:03

e12e reviewed Jul 24, 2023

View reviewed changes

unak approved these changes Sep 14, 2023

View reviewed changes

mame reviewed Sep 14, 2023

View reviewed changes

nevans force-pushed the uuid_v7 branch from 63d18fd to 57dbabb Compare September 16, 2023 17:53

rhenium reviewed Sep 16, 2023

View reviewed changes

lib/random/formatter.rb Outdated Show resolved Hide resolved

nevans force-pushed the uuid_v7 branch from 57dbabb to 165dac1 Compare September 18, 2023 21:01

nevans force-pushed the uuid_v7 branch from 165dac1 to 34ed1a2 Compare September 18, 2023 21:07

hsbt merged commit 61cb29e into ruby:master Sep 19, 2023

nevans deleted the uuid_v7 branch September 19, 2023 12:45

kachick mentioned this pull request Sep 26, 2023

UUIDv6, UUIDv7, UUIDv8 kachick/ruby-ulid#37

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for UUID version 7 #19

Add support for UUID version 7 #19

nevans commented Jun 29, 2023

nevans commented Jun 29, 2023

nevans Jul 3, 2023

e12e Jul 24, 2023

nevans Sep 15, 2023 •

edited

Loading

e12e commented Jul 24, 2023

unak left a comment

mame Sep 14, 2023

nevans Sep 15, 2023

nevans Sep 15, 2023 •

edited

Loading

mame Sep 19, 2023

mame Sep 19, 2023

nevans Sep 19, 2023

hsbt commented Sep 15, 2023

nevans commented Sep 16, 2023

nevans commented Sep 18, 2023 •

edited

Loading

-  #
+  #
+  # Note that this method cannot be made reproducable with Kernel#srand, which
+  # can only affect the random bits.  The sorted bits will still be based on
+  # Process.clock_gettime.
+  #

Add support for UUID version 7 #19

Add support for UUID version 7 #19

Conversation

nevans commented Jun 29, 2023

nevans commented Jun 29, 2023

nevans Jul 3, 2023

Choose a reason for hiding this comment

e12e Jul 24, 2023

Choose a reason for hiding this comment

nevans Sep 15, 2023 • edited Loading

Choose a reason for hiding this comment

e12e commented Jul 24, 2023

unak left a comment

Choose a reason for hiding this comment

mame Sep 14, 2023

Choose a reason for hiding this comment

nevans Sep 15, 2023

Choose a reason for hiding this comment

nevans Sep 15, 2023 • edited Loading

Choose a reason for hiding this comment

mame Sep 19, 2023

Choose a reason for hiding this comment

mame Sep 19, 2023

Choose a reason for hiding this comment

nevans Sep 19, 2023

Choose a reason for hiding this comment

hsbt commented Sep 15, 2023

nevans commented Sep 16, 2023

nevans commented Sep 18, 2023 • edited Loading

nevans Sep 15, 2023 •

edited

Loading

nevans Sep 15, 2023 •

edited

Loading

nevans commented Sep 18, 2023 •

edited

Loading