AO3-6328 Try to fix large tags not updating in uses sort #4637

weeklies · 2023-10-07T19:38:25Z

Pull Request Checklist

Have you read "How to write the perfect pull request"?
Have you read the contributing guidelines?
Have you added tests for any changed functionality?
Have you added the Jira issue number
as the first thing in your pull request title (e.g. AO3-1234 Fix thing)
Do you have fewer than 5 pull requests already open? If not, please wait
until they are reviewed and merged before creating new pull requests.

Issue

https://otwarchive.atlassian.net/browse/AO3-6328 (Second PR)

Purpose

Tries to ensure that when the number of uses of a large tag increases (e.g. when a work is posted), a reindex is triggered. After some investigation, it looks like the cached tag count prevents write_taggings_to_redis from being called when the tag has at least 1000 uses (TAGGINGS_COUNT_MIN_CACHE_COUNT).

I am not 100% sure of this approach, which is to remove the old cached count in after_update. Another way I believe to be possible is setting the taggings_count_cache range in ScheduledTagJob to be a lower value like TAGGINGS_COUNT_MIN_CACHE_COUNT. Not sure of the broader implications of these methods.

Testing Instructions

See Jira issue.

References

#4632

Credit

weeklies (she/her)

ceithir · 2023-10-09T19:20:15Z

otwarchive/app/models/tag.rb

Lines 37 to 46 in 660a67a

    
           def self.taggings_count_expiry(count) 
        
             # What we are trying to do here is work out a resonable amount of time for a work to be cached for 
        
             # This should take the number of taggings and divide it by TAGGINGS_COUNT_CACHE_DIVISOR  ( defaults to 1500 ) 
        
             # such that for example 1500, would be naturally be tagged for one minute while 105,000 would be cached for 
        
             # 70 minutes. However we then apply a filter such that the minimum amount of time we will cache something for 
        
             # would be TAGGINGS_COUNT_MIN_TIME ( defaults to 3 minutes ) and the maximum amount of time would be 
        
             # TAGGINGS_COUNT_MAX_TIME ( defaulting to an hour ). 
        
             expiry_time = count / (ArchiveConfig.TAGGINGS_COUNT_CACHE_DIVISOR || 1500) 
        
             [[expiry_time, (ArchiveConfig.TAGGINGS_COUNT_MIN_TIME || 3)].max, (ArchiveConfig.TAGGINGS_COUNT_MAX_TIME || 50) + count % 20 ].min 
        
           end

Do we know what are the values of TAGGINGS_COUNT_CACHE_DIVISOR, TAGGINGS_COUNT_MIN_TIME and TAGGINGS_COUNT_MAX_TIME on staging? Just to check they were not set to extremes quite different from the defaults to test things, ending up with an abnormally high taggings_count_expiry.

sarken · 2023-10-09T23:31:43Z

@ceithir I just checked and all three are nil on staging.

ceithir · 2023-10-10T16:37:59Z

I don't think it's the sole source of the current issue, as it only affects tags with a count < 1000, but dropping it here to not forget:
update_tag_cache tries to update the cache if the cached value is less than TAGGINGS_COUNT_MIN_CACHE_COUNT... But it seems to be doing nothing in that case, as there's an early return for any non nil value of the cache, be it 1, 999 or 1001.

ceithir · 2023-10-11T13:05:19Z

Out of the blue, I'm wondering if the call to write_taggings_to_redis, and the whole job that goes with it, couldn't be replaced with something straightforward like:

if value != taggings_count_cache
  # Skipping callback to avoid changing the current in memory object, and cascading side effects in general, as we're in a getter
  Tag.where(id: self.id).update_all(taggings_count_cache: value)
  self.enqueue_to_index
end

What lock/performance issue am I missing here?

weeklies · 2023-10-12T16:03:26Z

@ceithir I didn't want to try to redesign the tags update code, because I worry about unintended consequences considering the complexity of the various async jobs. If AD&T decides to do so, I think it should be in a separate issue.

…)" This reverts commit 4cacf7b.

) Revert "AO3-6328 Try to fix large tags not updating in uses sort (#4637)" This reverts commit 4cacf7b.

weeklies added 4 commits October 7, 2023 19:38

AO3-6328 Fix large tags (cached tags) not updating in uses sort

31bfae5

Style

d05cf55

AO3-6328-spr Hound

e107ade

AO3-6328-spr Comment tweak

1d43366

github-actions bot added the Awaiting Review label Oct 7, 2023

AO3-6328-spr Hound?

17711c4

sarken added the Priority: High - Broken on Test Merge immediately after approval label Oct 8, 2023

brianjaustin added this to the 0.9.351 milestone Oct 16, 2023

sarken approved these changes Oct 16, 2023

View reviewed changes

sarken added Reviewed: Ready to Merge and removed Awaiting Review labels Oct 16, 2023

sarken merged commit 4cacf7b into otwcode:master Oct 16, 2023
24 checks passed

sarken added a commit that referenced this pull request Oct 17, 2023

Revert "AO3-6328 Try to fix large tags not updating in uses sort (#4637…

0aeda01

…)" This reverts commit 4cacf7b.

sarken mentioned this pull request Oct 17, 2023

Revert "AO3-6328 Try to fix large tags not updating in uses sort" #4641

Merged

sarken added a commit that referenced this pull request Oct 17, 2023

Revert "AO3-6328 Try to fix large tags not updating in uses sort" (#4641

848b4b8

) Revert "AO3-6328 Try to fix large tags not updating in uses sort (#4637)" This reverts commit 4cacf7b.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AO3-6328 Try to fix large tags not updating in uses sort #4637

AO3-6328 Try to fix large tags not updating in uses sort #4637

weeklies commented Oct 7, 2023

ceithir commented Oct 9, 2023 •

edited

Loading

sarken commented Oct 9, 2023

ceithir commented Oct 10, 2023

ceithir commented Oct 11, 2023

weeklies commented Oct 12, 2023

AO3-6328 Try to fix large tags not updating in uses sort #4637

AO3-6328 Try to fix large tags not updating in uses sort #4637

Conversation

weeklies commented Oct 7, 2023

Pull Request Checklist

Issue

Purpose

Testing Instructions

References

Credit

ceithir commented Oct 9, 2023 • edited Loading

sarken commented Oct 9, 2023

ceithir commented Oct 10, 2023

ceithir commented Oct 11, 2023

weeklies commented Oct 12, 2023

ceithir commented Oct 9, 2023 •

edited

Loading