Optimize out O(n) queries when updating ManyRelatedField #4917

orf · 2017-02-21T14:40:51Z

Hi,
I was surprised to find that when updating a m2m relation DRF makes O(n) queries when updating ManyRelatedField instances due to this loop: https://github.com/tomchristie/django-rest-framework/blob/master/rest_framework/relations.py#L489-L498

I think it was designed this way to throw a reasonable error if a related PK does not exist, and to be as extensible as possible.

However I think the most common case of using a ManyRelatedField isn't served well by this and could be improved. The most common case would be just a ManyRelatedField with the related objects pk as the value, and the current implementation will make N queries for the models.

And to override this you would need to create a custom field and override the to_internal_value function to fetch it.

For this common case wouldn't something like this implementation be suitable, and couldn't something like this be included in DRF?

       def to_internal_value(self, data):
           if isinstance(data, type('')) or not hasattr(data, '__iter__'):
                self.fail('not_a_list', input_type=type(data).__name__)
           if not self.allow_empty and len(data) == 0:
                self.fail('empty')

           values = list(self.child_relation.get_queryset.filter(pk__in=data))

           if len(values) != len(data):
               missing_primary_keys = set(v.pk for v in values) - set(data)
               self.fail('missing_ids', ids_not_found=list(missing_primary_keys))

           return values

Just use pk__in to fetch all the models in a single query, then compare the length of the result with the length of the given data. If they don't match then we are missing a record and we can use a set operator to find out which ones and raise a validation error with the missing primary keys?

The text was updated successfully, but these errors were encountered:

tomchristie · 2017-04-24T19:22:37Z

Couldn't they be the same length but different values?

orf · 2017-04-24T19:45:33Z

Yeah, good point, I guess you should just use the set difference from the get-go. It's more about using '__in' instead of repeated single queries though.

E.g:

           values = list(self.child_relation.get_queryset.filter(pk__in=data))
           missing_primary_keys = set(v.pk for v in values) - set(data)
           if missing_primary_keys:
               self.fail('missing_ids', ids_not_found=list(missing_primary_keys))

Or something to that effect. Perhaps I'm missing the point and there is a good reason for the repeated queries though?

tomchristie · 2017-04-24T19:50:21Z

It's probably worth throwing up a pull request with this suggestion. If it doesn't change any API and it doesn't break any tests then it sounds like we'd be good. 👍

orf · 2017-04-24T19:54:03Z

I can give it a go and see if anything fails, but I'm not sure how to handle the self.child_relation.to_internal_value call in the original code. That is missing and that would likely break something, somewhere.

orf · 2017-04-24T20:38:56Z

Well, I got all the tests to pass by using an isinstance check for PrimaryKeyRelatedField. Not sure if this is the best way, using the pk_optimization stuff didn't seem to work.

ref encode#4917.

carltongibson · 2017-07-10T19:29:58Z

Closing as a known limitation in line with #5150. (Full featured PRs welcomed.)

orf mentioned this issue Apr 24, 2017

#4917 - Remove O(n) queries in m2m updates #5093

Closed

carltongibson added a commit to carltongibson/django-rest-framework that referenced this issue May 17, 2017

Proof of Concept for single lookup on HyperlinkRelatedField

083e107

ref encode#4917.

carltongibson added a commit to carltongibson/django-rest-framework that referenced this issue May 17, 2017

Proof of Concept for single lookup on HyperlinkRelatedField

160e203

ref encode#4917.

carltongibson mentioned this issue May 17, 2017

Proof of Concept for single lookup on HyperlinkRelatedField #5150

Closed

carltongibson added a commit to carltongibson/django-rest-framework that referenced this issue May 17, 2017

Proof of Concept for single lookup on HyperlinkRelatedField

e811365

ref encode#4917.

carltongibson closed this as completed Jul 10, 2017

jonasN5 mentioned this issue Mar 7, 2024

Optimize M2M Field #9276

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize out O(n) queries when updating ManyRelatedField #4917

Optimize out O(n) queries when updating ManyRelatedField #4917

orf commented Feb 21, 2017 •

edited

Loading

tomchristie commented Apr 24, 2017

orf commented Apr 24, 2017 •

edited

Loading

tomchristie commented Apr 24, 2017

orf commented Apr 24, 2017 •

edited

Loading

orf commented Apr 24, 2017

carltongibson commented Jul 10, 2017

Optimize out O(n) queries when updating ManyRelatedField #4917

Optimize out O(n) queries when updating ManyRelatedField #4917

Comments

orf commented Feb 21, 2017 • edited Loading

tomchristie commented Apr 24, 2017

orf commented Apr 24, 2017 • edited Loading

tomchristie commented Apr 24, 2017

orf commented Apr 24, 2017 • edited Loading

orf commented Apr 24, 2017

carltongibson commented Jul 10, 2017

orf commented Feb 21, 2017 •

edited

Loading

orf commented Apr 24, 2017 •

edited

Loading

orf commented Apr 24, 2017 •

edited

Loading