Improve performance of garbage collection job #4780

bjester · 2024-10-08T18:51:05Z

Background

We run the garbage_collect management command everyday via a Kubernetes cron job.

Observed behavior

The management command gets stuck running the following query. It was stuck running this for at least 12 hours in production, without any indication that it would complete.

UPDATE "contentcuration_contentnode" 
SET "parent_id" = '00000000000000000000000000000000' 
WHERE (
  SELECT U0."id" 
  FROM "contentcuration_channel" U0 
  INNER JOIN "contentcuration_contentnode" U1 ON (U0."main_tree_id" = U1."id") 
  WHERE U1."tree_id" = "contentcuration_contentnode"."tree_id" LIMIT 1
) IN (
  '<REDACTED>', 
  '<REDACTED>', 
  '<REDACTED>', 
  '<REDACTED>'
)

Expected behavior

The management command and the queries it produces are optimized for our large Studio database. It should avoid queries that would require the subquery for each node in the result set.

User-facing consequences

Since the queries should occur within a transaction, a very-long running transaction modifying the content node table can cause issues and timeouts across studio, leading to indirect Sentry errors.

Additionally, I had to manually kill the query in Cloud SQL.

Steps to reproduce the issue

Restored production database to hotfixes environment
Manually started the job
Observed running queries

The text was updated successfully, but these errors were encountered:

bjester · 2024-11-14T17:54:01Z

Running the fix #4808 on the hotfixes server has easily proceeded past the previous point of failure.

bjester added this to the Studio: Q4 patch release 1 milestone Oct 16, 2024

bjester added TAG: tech update / debt DEV: backend TAG: performance P1 - important Priority: High impact on UX labels Oct 16, 2024

rtibbles assigned ozer550 Oct 22, 2024

ozer550 mentioned this issue Nov 8, 2024

optimize garabge collection command #4808

Merged

24 tasks

bjester closed this as completed Nov 14, 2024

bjester mentioned this issue Nov 18, 2024

Patch release v2024.12.03 #4830

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of garbage collection job #4780

Improve performance of garbage collection job #4780

bjester commented Oct 8, 2024 •

edited

Loading

bjester commented Nov 14, 2024

Improve performance of garbage collection job #4780

Improve performance of garbage collection job #4780

Comments

bjester commented Oct 8, 2024 • edited Loading

Background

Observed behavior

Expected behavior

User-facing consequences

Steps to reproduce the issue

bjester commented Nov 14, 2024

bjester commented Oct 8, 2024 •

edited

Loading