-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot perform any retrieval for 32GB files #5409
Comments
|
Thank you for the bug reports, it's very helpful when they're so well structured like this 👍 @dgreuel2002 I believe the crash you're seeing is coming from this line:
Is it possible there are messages stuck in the message pool preventing the deal from proceeding?
This sounds like a crash on the miner side, as you suggested. We will investigate all these issues, thanks again for the detailed report. |
@dirkmc do you mean his deal price for "retrieval" is free? How to set the retrieval price anyway? Is it always the same as storage price for unverified deals? I looked into the help of "lotus-miner storage set-ask" but there's no mention of retrieval. USAGE: OPTIONS: |
@kernelogic Kernelogic it is @dirkmc yes I do have it set to 0 and price per byte set to .0000000000002 I have no messages stuck as the rest of miner operations are fine and local mpool is clear. the miner abruptly haults as soon as the unseal message hits it. The log I attached is the end of the kiner log as it had haulted and didnt indicate to why. Please let me know if you need more info on this. @stuberman also has this same error when he runs lotus-miner retrieval-deals list. I am on the road but please reach out for more info if needed. |
@dirkmc @kernelogic @jennijuju |
I am storing both sealed and unsealed sectors. I my miner logs I see attempts to retrieve deals, both from unsealed sectors and sealed sectors that all seem to fail. Two sectors transferred to my worker/sealing repo to unseal and are still sitting there idle (sectors 305 and 440). My miner logs also show an unsealed retrieval attempt (sector 330) which also fails. Here is a sample of logs for sector 330 on miner
Logs for sector 440 from miner
Logs from lotus-worker for 440
|
That align with my attempt, @stuberman . All end up with deal data transfer failed: incomplete response. |
have you tried on fast-retrieval=true deals ? |
fast-retrieval=true just send you a local copy of the file that doesn't required any computation |
I'm going to try now. Let you know. |
No I haven't tried. I don't want to waste too much space of my providers so all my slingshot deals were sent as fast-retrieval=false. Now this decision might come back bite me as slingshot judges might deduct rewards for failed retrievals. |
seems like miners are now blocking retrieval deals to protect their miners ... |
We have the same unmarshalling error issue.
|
I am still getting this when I do "lotus-miner retrieval-deals list" this happened after @kernelogic sent me this deal in this post and it crashed miner....has there been any movement on this or should I open a separate issue? @s0nik42 |
@dgreuel2002 the fix is in #5493, however this has not been released yet. It should go out in the next lotus release. |
@dirkmc are you talking about
|
@s0nik42 I have updated to the new version and am still getting the lotus-miner retrieval-deals list output garbage |
yes, I'm not sure these 2 issues are related |
Sorry I didn't look at the log, I just read your message. #5493 should fix the crash when deal price per byte is zero |
We believe the unsealing issue is in the proof instead of lotus, track here filecoin-project/rust-fil-proofs#1402. |
I am seeing 'Stuck after DealStatusOngoing' on my miner, when trying to retrieve a file that is already unsealed. The miner tries to unseal it again. lotus-miner version 1.5.2+mainnet+git.cd293aa72.dirty Client:
miner.log:
~# lotus-miner sealing jobs
~# lotus-miner storage find 1369
UPDATE: UPDATE2: |
with lotus 1.5.0 Mar 17 15:42:11 miner7 lotus-miner[1701810]: 2021-03-17T15:42:11.349Z INFO markets loggers/loggers.go:30 retrieval provider event {"name": "ProviderEventUnsealComplete", "deal ID": "12", "receiver": "12D3KooWLb3vya3V3ZU39jZRfVsTEY6KB93czht8AdVAF3pMCgqA", "state": "DealStatusUnsealed", "message": ""} |
@kernelogic - any updates on this based on changes introduced in 1.6 and 1.7? Is this still an ongoing problem? |
Going to close this out for the time being. Please open a new Issue for specific issues you may still be experiencing. |
Describe the bug
I tried several retrievals to several different miners for the deals I made. None of them went through. I do not believe they all offline. I suspect it has something to do with the deal size. These are all 32GB offline deals, made with --fast-retrieval=false.
Example 1 - stuck after DealStatusOngoing
Provider is doing unsealing but it takes hours without going further
lotus client retrieve --miner f024008 bafykbzaced6noeziyglm2frycaknydeijbimvhpk72eimhfupoxd72dehp2k4 abc
Example 2 - unmarshalling error
The provider told me his miner crashed upon receiving my retrieval request
lotus client retrieve --miner f083550 bafykbzacecu4qt4tlr5vqojgtlt553rbskqremq67z37x5cufaonjqfh5y7no mysql-2016-04-19.tar.gz.partbl
2021-01-25T04:59:20.145Z WARN rpc [email protected]/client.go:541 unmarshaling failed {"message": "{"Err":"exhausted 5 attempts but failed to open stream, err: failed to dial 12D3KooWFJ6iPAiW82pR7REB8pJfKAsoKoEAhgWKYduoQT1734a9: all dials failed\n * [/ip4/127.0.0.1/tcp/45989] dial tcp4 127.0.0.1:45989: connect: connection refused\n * [/ip6/::1/tcp/41337] dial tcp6 [::1]:41337: connect: connection refused\n * [/ip4/59.12.56.212/tcp/45989] dial tcp4 59.12.56.212:45989: connect: connection refused\n * [/ip4/59.12.56.215/tcp/45989] dial tcp4 0.0.0.0:33463-\u003e59.12.56.215:45989: i/o timeout","Root":null,"Piece":null,"Size":0,"MinPrice":"\u003cnil\u003e","UnsealPrice":"\u003cnil\u003e","PaymentInterval":0,"PaymentIntervalIncrease":0,"Miner":"f083550","MinerPeer":{"Address":"f083550","ID":"12D3KooWFJ6iPAiW82pR7REB8pJfKAsoKoEAhgWKYduoQT1734a9","PieceCID":null}}"}
ERROR: RPC client error: unmarshaling result: failed to parse big string: '"\u003cnil\u003e"'
Example 3 - incomplete response
One of the provider told me his miner is set as allowUnseal=false but his worker is set as allowUnseal=true, yet the retrieval failed before the worker picked up the unseal.
lotus client retrieve --miner f064218 baga6ea4seaqgsbrsupr6az5zf2yjqai5t4xtgn7lf2zd62wnxxq2mgyhlvbxyda publicdomainmovies.tar.05
lotus client retrieve --miner f01278 bafykbzaceaj7ube4k2vhniqgdb6vq7ggcoebf4c6punse7ike2almme6esdxi mysql-2018-11-01.tar.gz.partcy
Example 4 - miner is not accepting online retrieval deals
lotus client retrieve --miner f047419 baga6ea4seaqgsbrsupr6az5zf2yjqai5t4xtgn7lf2zd62wnxxq2mgyhlvbxyda publicdomainmovies.tar.05
Example 5 - normal shutdown of state machine
This provider is able to serve my retrieval for a 8GB file, but not 32GB ones.
lotus client retrieve --miner f022352 baga6ea4seaqoyyc52ajdq2v7723d3uixeem5ulwvbtv7h45776vrrg53nnf2gjq trusted_setup_phase2.tar.47
lotus version 1.4.1+git.d6c06881e
The text was updated successfully, but these errors were encountered: