Improve the backup schedule cron job so that only one backup runs at a time. #85

calind · 2018-07-23T16:20:56Z

Controller should not rely on an internal lock, but rather check if there's an active job
https://github.com/presslabs/mysql-operator/blob/e0ef8bb900001a91db4aa0e78596a8f8a0bf9d12/pkg/controller/clustercontroller/backups.go#L101-L102
In case of an active job, we should create an automated backup, but mark it as failed, with the reason that another backup is running

cu12 · 2018-10-09T12:13:06Z

@calind I concur, I recently saw some weird issues due to this

I had a cluster down, but the operator kept starting the new backup jobs and that resulted in two errors in our environment which I believe are connected to this.

We ran into sshd.socket stops working after a while coreos/bugs#2181 on nodes, where these pods were scheduled (gazillion pods every second)
Apps that were talking to php-fpm on these nodes were stuck after some time

It's a long shot, but either some back-off and/or mechanism to defer the backup job when cluster is not in ready state would be nice

HBO2 · 2018-10-18T07:27:38Z

@calind I totally agree with @cu12. It takes a while to have the cluster up and running and I had to some big issues with a multitude of failing pods based on the backup jobs.

And also when the S3 is nog configures correctly for some reason, you will get an immense amount of pods.

thx

calind added this to the 0.2.x milestone Jul 23, 2018

AMecea modified the milestones: 0.2.x, 0.2.6 Feb 25, 2019

AMecea modified the milestones: 0.2.6, 0.2.7 Mar 4, 2019

AMecea mentioned this issue Mar 14, 2019

Refactor backup cronjob #255

Merged

AMecea added the in progress label Mar 14, 2019

calind closed this as completed in #255 Mar 21, 2019

AMecea removed the in progress label Mar 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the backup schedule cron job so that only one backup runs at a time. #85

Improve the backup schedule cron job so that only one backup runs at a time. #85

calind commented Jul 23, 2018 •

edited by milero

Loading

cu12 commented Oct 9, 2018

HBO2 commented Oct 18, 2018 •

edited

Loading

Improve the backup schedule cron job so that only one backup runs at a time. #85

Improve the backup schedule cron job so that only one backup runs at a time. #85

Comments

calind commented Jul 23, 2018 • edited by milero Loading

cu12 commented Oct 9, 2018

HBO2 commented Oct 18, 2018 • edited Loading

calind commented Jul 23, 2018 •

edited by milero

Loading

HBO2 commented Oct 18, 2018 •

edited

Loading