DRIVERS-1408 - Add guidance on adding _id fields to documents to CRUD spec #1688

NoahStapp · 2024-10-28T16:08:19Z

Python implementation: mongodb/mongo-python-driver#1976

The new prose test only verifies that drivers correctly implement prepending _id for insert_one. Is it worthwhile to verify every other insert operation (insert_many, bulk_write, client_bulk_write) as well?

… spec

source/crud/tests/README.md

source/crud/crud.md

jmikola · 2024-10-28T18:14:22Z

source/crud/crud.md

@@ -815,6 +815,12 @@ database-level aggregation will allow users to receive a cursor from these colle

 ##### Insert, Update, Replace, Delete, and Bulk Writes

+###### Generated identifiers
+
+The insert and bulk insert operations described below MUST generate identifiers for all documents that do not already


Noted that this is already discussed in the client bulk write spec.

Having this behavior discussed here for general CRUD operations makes sense to me, as the client bulk write spec follows from this broader context.

The client bulk write spec stands on its own, except for a reference back to CRUD for modeling unacknowledged write results.

With my last comment, I just meant to acknowledge that no changes were needed to the client bulk write spec since it already addressed _id ordering. This was in the vein of previous PRs like #1644 that required updates to both specs.

jmikola · 2024-10-28T18:16:53Z

source/crud/tests/README.md

+
+Construct a `MongoClient` (referred to as `client`) with
+[command monitoring](../../command-logging-and-monitoring/command-logging-and-monitoring.md) enabled to observe
+CommandStartedEvents. Perform an `insert` command using `client` and assert that one CommandStartedEvent (referred to as


Perform an insert command

This is ambiguous, as the prose test could be implementing using the generic command runner, in which case I'd expect no ID generation. I'd suggest changing this to explicitly suggesting running an insertOne operation. You can extend that with insertMany and a bulkWrite if you want full test coverage.

source/crud/tests/README.md

jmikola · 2024-10-28T21:21:14Z

source/crud/tests/README.md

+CommandStartedEvents. For each of `insertOne`, client `bulkWrite`, and collection `bulkWrite`, do the following:
+
+- Execute the command with a document that does not contain an `_id` field.
+- If possible, capture the wire protocol message (referred to as `request`) of the command.


Is this only necessary in languages where the ordering is still not deterministic in the CommandStartedEvent's command document?

This is the preferred way to verify the ordering of the document, as drivers may modify the document between emitting the CommandStartedEvent and the actual wire transfer of the command. For example, the Python driver re-orders _id to be the first field during BSON conversion, which takes place after the CommandStartedEvent would be emitted.

Verifying the order of the actual transmitted payload document ensures that the server receives exactly what we expect it to.

I think it could potentially be rephrased if wire protocol parsing is difficult for drivers to achieve, but I agree it would be easy to assert _id is the first key in command monitoring and that not actually be the case when the wire message is produced.

I think perhaps for this test; we can encourage drivers to just use command monitoring as the "main path" set of assertions to implement. But we should encourage drivers to check that their wire message key order either matches their command events or that reordering is done when the wire message is built.

So for Node, I would write a test that asserts the JS object that is inspectable on the event (for any command) matches key order in BSON. Python may check that their wire messages are ordered correctly.

If drivers are able to check wire messages directly, they should take that path. Otherwise, they should fall back to using command monitoring, ideally verifying that they don't modify field ordering before sending data over the wire.

I agree, after all the point here is that the bytes are in an order that the server benefits from, if your command monitoring is a source of truth for such a thing you could/should use it but the real goal is in the wire message.

source/crud/tests/README.md

source/crud/crud.md

source/crud/tests/README.md

nbbeeken · 2024-10-29T17:38:50Z

source/crud/tests/README.md

+CommandStartedEvents. For each of `insertOne`, client `bulkWrite`, and collection `bulkWrite`, do the following:
+
+- Execute the command with a document that does not contain an `_id` field.
+- If possible, capture the wire protocol message (referred to as `request`) of the command.


I think it could potentially be rephrased if wire protocol parsing is difficult for drivers to achieve, but I agree it would be easy to assert _id is the first key in command monitoring and that not actually be the case when the wire message is produced.

I think perhaps for this test; we can encourage drivers to just use command monitoring as the "main path" set of assertions to implement. But we should encourage drivers to check that their wire message key order either matches their command events or that reordering is done when the wire message is built.

So for Node, I would write a test that asserts the JS object that is inspectable on the event (for any command) matches key order in BSON. Python may check that their wire messages are ordered correctly.

jmikola

LGTM w/ or w/o my suggestion.

source/crud/tests/README.md

Co-authored-by: Jeremy Mikola <[email protected]>

DRIVERS-1408 - Add guidance on adding _id fields to documents to CRUD…

2955d19

… spec

NoahStapp requested a review from a team as a code owner October 28, 2024 16:08

NoahStapp requested review from nbbeeken and removed request for a team October 28, 2024 16:08

NoahStapp changed the title ~~DRIVERS-1408 - Add guidance on adding _id fields to documents to CRUD…~~ DRIVERS-1408 - Add guidance on adding _id fields to documents to CRUD spec Oct 28, 2024

Merge branch 'master' into DRIVERS-1408

925cd95

nbbeeken requested changes Oct 28, 2024

View reviewed changes

source/crud/tests/README.md Outdated Show resolved Hide resolved

nbbeeken reviewed Oct 28, 2024

View reviewed changes

source/crud/crud.md Outdated Show resolved Hide resolved

nbbeeken mentioned this pull request Oct 28, 2024

perf(NODE-6466): make _id appear first in document mongodb/node-mongodb-native#4295

Draft

5 tasks

jmikola reviewed Oct 28, 2024

View reviewed changes

Update prose test

1afe0c0

NoahStapp requested a review from a team as a code owner October 28, 2024 20:16

NoahStapp requested review from durran, nbbeeken and jmikola and removed request for a team October 28, 2024 20:16

jmikola reviewed Oct 28, 2024

View reviewed changes

alcaeus reviewed Oct 29, 2024

View reviewed changes

source/crud/tests/README.md Show resolved Hide resolved

nbbeeken requested changes Oct 29, 2024

View reviewed changes

Clarify user-supplied ID fields

48ce014

NoahStapp requested review from alcaeus, nbbeeken and jmikola October 30, 2024 13:35

nbbeeken approved these changes Oct 30, 2024

View reviewed changes

alcaeus removed their request for review October 31, 2024 09:03

NoahStapp removed the request for review from durran October 31, 2024 13:30

jmikola approved these changes Oct 31, 2024

View reviewed changes

source/crud/tests/README.md Outdated Show resolved Hide resolved

NoahStapp and others added 2 commits October 31, 2024 11:52

Update source/crud/tests/README.md

e57594a

Co-authored-by: Jeremy Mikola <[email protected]>

Merge branch 'master' into DRIVERS-1408

549ef56

NoahStapp merged commit b607a57 into mongodb:master Oct 31, 2024
4 of 5 checks passed

NoahStapp deleted the DRIVERS-1408 branch October 31, 2024 16:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DRIVERS-1408 - Add guidance on adding _id fields to documents to CRUD spec #1688

DRIVERS-1408 - Add guidance on adding _id fields to documents to CRUD spec #1688

NoahStapp commented Oct 28, 2024

jmikola Oct 28, 2024

NoahStapp Oct 28, 2024

jmikola Oct 28, 2024

jmikola Oct 28, 2024

jmikola Oct 28, 2024

NoahStapp Oct 29, 2024 •

edited

Loading

nbbeeken Oct 29, 2024

NoahStapp Oct 29, 2024

nbbeeken Oct 29, 2024

nbbeeken Oct 29, 2024

jmikola left a comment

DRIVERS-1408 - Add guidance on adding _id fields to documents to CRUD spec #1688

DRIVERS-1408 - Add guidance on adding _id fields to documents to CRUD spec #1688

Conversation

NoahStapp commented Oct 28, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

NoahStapp Oct 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmikola left a comment

Choose a reason for hiding this comment

NoahStapp Oct 29, 2024 •

edited

Loading