-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dangling leases problem #265
Comments
@TormenTeDx - have you attempted to heal the dangling leases by executing the script referenced in this doc: https://akash.network/docs/providers/provider-faq-and-guide/#dangling-deployments While the title of this section is "Dangling Deployments" it actually cures both dangling deployments (I..e when a lease has been closed but the deployment in K8s remains) and dangling leases (I.e. when the lease on chain is active but no deployment exists in K8s.). This is a direct link to the suggested script to cure both scenarios: https://gist.github.com/andy108369/f211bf6c06f2a6e3635b20bdfb9f0fca While this is not a permanent fix - it should cure the current issue and close out all stale leases for your provider on the blockchain. And within that script this is the section that address dangling leases. Run the script as is and in it's entirety. Just isolating this section below for knowledge of the relevant section.
|
Yes, it found orphaned leases. I closed all of them. My numbers look good now |
OK here is some extra info. 100% it happens when restarting the pod. |
@TormenTeDx - could you provide precise, ordered steps to re-create the issue? And please confirm the Akash provider version active on provider. For example: Step 1 - launch a number of deployments using XYZ SDL (provide link to SDL) Step 2 - restart the provider pod Step 3 - observed behavior has been the deployments are now closed but lease remain on chain Believe above steps capture those that should be used to reproduce the issue. But wanted to ensure. And also want to ensure provider version and SDL used. As I believe there was a thought prior that this could be related to specific SDL/deployment types. Want to first focus on provider functionality and can later focus on reporting issues if need be. |
pod restart and all leases lost... running triple control planes and triple etcd on individual servers if memory serves, then in the past this issue was sometimes caused by rpc nodes or control planes if they was immediately lost. my provider pod didn't restart excessively.... deleting the dangling is hardly a real fix... it can take months for providers to accumulate customers. if we imagine 3 months to fill a provider, then this happens and one had to start over. due to such an event. i'm not currently aware of any obvious path to creating dangling leases. |
there are actually 2 ways it can happen.
I dont know why it happens. It's just sometimes it happens. One day I scaled down/up provider just to check something and like 80% of leases went dangling. I tried doing the same thing 1h later but nothing happened. |
I think yesterday my provider pod restarted and then I lost all my current leases. They were gone from the provider, but they're still
active on the chain and the new leases work fine, but the old leases are gone from my provider and they are dangling leases on the chain.
Please look at the number of leases on the screenshot.
The graph shows 30 and on the top value shows 15. 15 is the number of leases I currently have on my provider - which means I have them as pods. The 30 is the number of leases there are currently on the chain. Which means 30-15 = 15 - there is 15 dangling leases on the chain on my provider. I checked on akashdash and it shows 30.
and they won't close when they will run out of $
they will only close when you withdraw - manually or automatically, and there is no more $ on the lease then it will close
but when you setup like big withdrawal time like 100h for example to get paid once a week
they will be active all the time. The only problem is they lock your 0.5 akt deposit bid fee.
The lease will stay active and the $ wont be paid untill withdrawal time.
I just ran manual withdrawal for all these leases, and few of them got closed right after.
dunno when it excatly happened, but I know the pod restarted for some reason and then after that I had no leases on my provider
I don't know how to excatly reproduce this. I know that it happened right after the pod restarted. It happened before occasionaly - maybe I had like 2-3 dangle leases in past 6 months, but it's never been like that where all the leases were just gone.
I noticed it happens also on other providers, whenever there's a difference in these two numbers then 100% there are some dangling leases on the chain and Im pretty sure this happens during the restart of the provider pod
The text was updated successfully, but these errors were encountered: