-
Notifications
You must be signed in to change notification settings - Fork 999
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Online store latency (redis) is growing over time #3597
Comments
I have noticed that timestamps of incremental materialization are stored in the registry and are sent to client as well. We run incr. mat. every 15 min, so over month it yields to 30 * 24 * 4 = 2880 timestamps per view, which might explain the gradual increase of the response time. Not sure if it is the reason, but decided to share the info here. |
Thanks for filing! do you mean that you changed from the file based registry to have a TTL=0 and it was ok? |
I have never adjusted registry cache TTL, it has always been set to the default value. |
Hi guys. Any news on this? This issue forces us to update the registry file in production every couple of weeks to reset the latency.. |
Hi @adchia |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
we found that logging to feast caused the problem, we disabled it and everything runs without leaks now |
@RadionBik hey, thanks for investigating this, can you clarify the last comments? Did you find that both incremental materialization payload and |
I might have left the last comment in a wrong issue. So the current status is:
Hope this clarifies the situation a bit |
@RadionBik thanks for the clarification |
Expected Behavior
The online-store latency remains the same and doesn't grow over the time. We assume that amount of data in redis is not growing.
Current Behavior
The median latency growth over the past 30 days is on the chart above.
Steps to reproduce
We use a custom aiohttp python service to relay requests to the online-store, where we invoke the native feast client. The service is completely stateless, and I don't expect it to be the source of the problem.
here is our feast config:
as you see, we use the default registry cache TTL (=600).
Specifications
Possible Solution
We noticed that changing the path to file-based registry (i.e. effectively re-creating it) eliminates the latency growth and it back to normal (today's chart):
Therefore, a solution might be related to fixing the registry caching mechanism.
Let me know if further details are needed!
The text was updated successfully, but these errors were encountered: