Use TimerTask in the 3scale batcher policy #786

davidor · 2018-06-22T16:26:21Z

This PR replaces the usages of ngx.timer.every with TimerTask instances.

This solves the issue of not being able to cancel timers. Recurrent tasks created using TimerTask can be canceled, but we cannot cancel timers created with ngx.timer.every. This was causing a bug, Timer would continue running even after a config reload.

mikz · 2018-06-22T16:44:18Z

gateway/src/apicast/policy/3scale_batcher/3scale_batcher.lua

-local function ensure_report_timer_on(self, service_id, backend)
-  local check_timer = self.semaphore_report_timer:wait(0)
+local function ensure_timer_task_created(self, service_id, backend)
+  local check_timer_task = self.semaphore_report_timer:wait(0)


Can we move this to .new ? This policy should not be initialized in init phase, right?

We can't because we don't have access to the service ID in .new()

Interesting. Maybe better to leave it for later, butI think we should think about changing the data structure to remove the service_id dependency. It should be as simple as create timer in new that gets reports from reports batcher. There should be no need for service_id because reports batcher instance is specific to this policy and timer, so it will ever contain only reports from this service. I see the reports batcher actually uses that service_id to do a lock on shmem. Hopefully we can solve that.

You're right. That part can be simplified a lot.

Could we generate some UUID instead of using service id ? Then every policy would take care just of its own data.

This is related to an idea that we discussed some time ago.

Right now, pending reports are stored in a shared dictionary and every instance of the policy creates a timer (per worker) to report all the pending reports for its service ID.

After introducing TimerTask I see it more clearly that it might be better to adopt a different strategy. We could store the pending reports in a table of the instance, as we know that there'll only be one instance of the policy for a given service ID per Apicast worker. Pros: no locks, simpler code. Cons: possibility of losing reports if a worker dies, which in practice I don't think it'll be an issue.

Let's leave this for a future PR.
In this one, I'd like to focus on switching to using TimerTask to be able to cancel timers.

Yep, definitely 👍

mikz · 2018-07-02T13:12:17Z

gateway/src/apicast/policy/3scale_batcher/3scale_batcher.lua

  local usage = context.usage
  local service = context.service
  local service_id = service.id
+  self.service_id = service_id


Why it is being assigned to self ?

Sorry, this is a leftover from some test. Same with self.backend.

mikz · 2018-07-02T13:13:35Z

gateway/src/resty/concurrent/timer_task.lua

+function _M:schedule_one(delay)
+  -- Need to wrap the task in a function that discards the first param (the
+  -- "premature" one sent by ngx.timer.at)
+  ngx.timer.at(delay or 0, function(_, ...) self.task(...) end, unpack(self.args))


This is going to allocate a function for each call. Is that necessary?

Does no longer apply. This method has been replaced.

mikz · 2018-07-02T13:15:42Z

spec/resty/concurrent/timer_task_spec.lua

+
+        -- Can't check all the arguments of ngx.timer.at because it calls an
+        -- private function but at least we can check the interval (first arg)
+        assert.equals(interval, ngx_timer_stub.calls[1].vals[1])


I think you can change this to use matchers and verify it was called with some function (matcher.function) and exact second parameter.

But the function is private, so I don't think we can do that.

Matcher can match the type like:

assert.spy(ngx_timer_stub).was_called_with(matcher.is_function(), 1)

http://olivinelabs.com/busted/#matchers

I'm familiar with matchers, but I misunderstood you.

I thought that you wanted to check that ngx_timer_stub was called with a specific function and I was saying that it was not possible because it's a private one. Now I see that you suggested just to check that it's a function no matter which one.

I added a check for this 👍

mikz · 2018-07-02T13:16:10Z

gateway/src/apicast/policy/3scale_batcher/3scale_batcher.lua

+
+  if self.timer_task then
+    self.timer_task:schedule_one(1)
+  end


And we will rely on cancelling the timer by __gc callbacks ?

That works, but let's cancel the timer_task here to be explicit about it.

mikz · 2018-07-02T13:16:21Z

gateway/src/apicast/policy/3scale_batcher/3scale_batcher.lua

+  -- run before that so we do not leave any pending reports.
+
+  if self.timer_task then
+    self.timer_task:schedule_one(1)


Why delay 1 and not 0 ?

:)

From the point of view of the policy, I think it does not matter. However, busted crashes with a segfault when this is 0. Not sure why exactly. It might be related to how busted collects the garbage after all the tests. It does not crash with a delay of 0.1.

Ok, then this is definitely worth a comment :-D

In the end I changed this and implemented it as we discussed off-line: ba883c1

Unfortunately, the specs were mocking ReportsBatcher and that was hiding a bug. ReportsBatcher:add() was not being called with the correct parameters.

…time

…er to real usage Before, the tests set a high reporting interval so a report was not triggered in the middle of the test. The tests relied on calling a specific endpoint to force a report to backend. Now, instead of doing that, we let the policy to run report jobs as it would normally do, we aggregate the data we receive in the reporting endpoint, and at the end of the test, we verify it. This approach is closer to a real usage of the policy.

mikz

👍

davidor requested a review from a team as a code owner June 22, 2018 16:26

octobot assigned davidor Jun 22, 2018

davidor changed the title ~~Use TimerTask in the 3scale batcher policy~~ [WIP] Use TimerTask in the 3scale batcher policy Jun 22, 2018

mikz reviewed Jun 22, 2018

View reviewed changes

davidor force-pushed the use-timertasks-in-batcher-policy branch 6 times, most recently from 11ccd3d to 511cce3 Compare June 28, 2018 11:11

davidor added 2 commits June 29, 2018 11:05

policy/3scale_batcher: use TimerTask

8749da1

spec/policy/3scale_batcher: adapt specs to use TimerTask

b3e60d7

davidor force-pushed the use-timertasks-in-batcher-policy branch from 511cce3 to bc4c74a Compare June 29, 2018 14:52

davidor changed the title ~~[WIP] Use TimerTask in the 3scale batcher policy~~ Use TimerTask in the 3scale batcher policy Jun 29, 2018

davidor changed the title ~~Use TimerTask in the 3scale batcher policy~~ [WIP] Use TimerTask in the 3scale batcher policy Jun 29, 2018

davidor force-pushed the use-timertasks-in-batcher-policy branch 2 times, most recently from 0b1cddd to 74ba6b1 Compare July 2, 2018 13:03

davidor changed the title ~~[WIP] Use TimerTask in the 3scale batcher policy~~ Use TimerTask in the 3scale batcher policy Jul 2, 2018

mikz reviewed Jul 2, 2018

View reviewed changes

davidor force-pushed the use-timertasks-in-batcher-policy branch 2 times, most recently from f3d949e to 5398758 Compare July 2, 2018 16:41

davidor added 5 commits July 2, 2018 18:42

resty/concurrent/timer_task: fix params in call to schedule_next()

8f10c0d

policy/3scale_batcher/reporter: fix bug when returning reports

4bb6be5

Unfortunately, the specs were mocking ReportsBatcher and that was hiding a bug. ReportsBatcher:add() was not being called with the correct parameters.

CHANGELOG: add switching to using TimerTask in batching policy

bfe1fb1

resty/concurrent/timer_task: add possibility of running for the last …

d7caa05

…time

policy/3scale_batcher: define __gc to avoid leaving pending reports

184616c

davidor force-pushed the use-timertasks-in-batcher-policy branch from 5398758 to b04d254 Compare July 2, 2018 16:43

mikz approved these changes Jul 2, 2018

View reviewed changes

davidor merged commit a95ebdd into master Jul 2, 2018

davidor deleted the use-timertasks-in-batcher-policy branch July 2, 2018 17:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use TimerTask in the 3scale batcher policy #786

Use TimerTask in the 3scale batcher policy #786

davidor commented Jun 22, 2018 •

edited

Loading

mikz Jun 22, 2018

davidor Jun 22, 2018

mikz Jun 22, 2018

davidor Jun 28, 2018

mikz Jun 28, 2018

davidor Jun 28, 2018

davidor Jun 29, 2018

mikz Jun 29, 2018

mikz Jul 2, 2018

davidor Jul 2, 2018

davidor Jul 2, 2018

mikz Jul 2, 2018

davidor Jul 2, 2018

mikz Jul 2, 2018

davidor Jul 2, 2018

mikz Jul 2, 2018

davidor Jul 2, 2018

davidor Jul 2, 2018

mikz Jul 2, 2018

davidor Jul 2, 2018

davidor Jul 2, 2018

mikz Jul 2, 2018

davidor Jul 2, 2018

mikz Jul 2, 2018

davidor Jul 2, 2018

mikz left a comment

Use TimerTask in the 3scale batcher policy #786

Use TimerTask in the 3scale batcher policy #786

Conversation

davidor commented Jun 22, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mikz left a comment

Choose a reason for hiding this comment

davidor commented Jun 22, 2018 •

edited

Loading