Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

upload-bin fails to upload index files #3658

Closed
AnnaShaleva opened this issue Nov 2, 2024 · 18 comments
Closed

upload-bin fails to upload index files #3658

AnnaShaleva opened this issue Nov 2, 2024 · 18 comments
Assignees
Labels
bug Something isn't working I4 No visible changes S4 Routine U1 Critically important to resolve quickly
Milestone

Comments

@AnnaShaleva
Copy link
Member


2024-11-02 11:03:54.592	Chain block height: 6284055
2024-11-02 11:03:56.388	Chain block height: 4902100
2024-11-02 11:03:58.730	Chain block height: 6621405
2024-11-02 11:04:01.577	Chain block height: 4504542
2024-11-02 11:06:50.445	Uploading index files...
2024-11-02 11:06:50.445	First block of latest incomplete batch uploaded to NeoFS container: 6250000
2024-11-02 11:06:56.839	Uploading index files...
2024-11-02 11:06:56.839	First block of latest incomplete batch uploaded to NeoFS container: 4460000
2024-11-02 11:07:49.082	Uploading index files...
2024-11-02 11:07:49.082	First block of latest incomplete batch uploaded to NeoFS container: 4870000
2024-11-02 11:08:53.590	Uploading index files...
2024-11-02 11:08:53.590	First block of latest incomplete batch uploaded to NeoFS container: 6590000
2024-11-02 13:13:45.533	Index file 0 generated, checking for the missing blocks...
2024-11-02 13:13:45.562	0 missing block(s) processed for index file 0, uploading index file...
2024-11-02 13:13:52.846	Index file 0 generated, checking for the missing blocks...
2024-11-02 13:13:57.482	Uploaded index file 0
2024-11-02 13:14:13.310	Index file 0 generated, checking for the missing blocks...
2024-11-02 13:14:19.667	failed to upload index files: block 89052 is missing from the storage
2024-11-02 13:14:22.703	Index file 0 generated, checking for the missing blocks...
2024-11-02 13:14:40.243	failed to upload index files: block 91326 is missing from the storage
2024-11-02 13:15:21.224	Chain block height: 6284540
2024-11-02 13:15:42.791	Chain block height: 4505042
2024-11-02 13:15:50.891	failed to upload index files: block 27475 is missing from the storage
2024-11-02 13:16:52.950	Chain block height: 4902596
2024-11-02 13:18:49.521	Uploading index files...

Blocks 91326 (FS testnet), 89052 (mainnet) and 27475 (testnet) can't be found in the storage. Check what's wrong with them or is it just a small number of retries.

@AnnaShaleva AnnaShaleva added bug Something isn't working U1 Critically important to resolve quickly S4 Routine I4 No visible changes labels Nov 2, 2024
@AnnaShaleva AnnaShaleva added this to the v0.107.0 milestone Nov 2, 2024
@AnnaShaleva
Copy link
Member Author

Within the scope of this issue let's move maxRetries constant to the CLI parameter.

@AliceInHunterland
Copy link
Contributor

Block 91326 (FS testnet) exisits.:

(base) ekaterinapavlova@MacBook-Air-4 neofs-node % ./bin/neofs-cli object search --ttl 1 -r st2.storage.fs.neo.org:8080 -w /Users/ekaterinapavlova/Workplace/neo-go/panelwallet1.json --filters 'Block EQ 91326' --cid FidCcVzUxzY3PeUksCnnpUQ8h8PJeYPessX8FRMuveSy

Found 1 objects.
HwkGuEX5bra2et7pm2CD2fs4vi9oqfrSrKjkX4GBc6m5

@AliceInHunterland
Copy link
Contributor

89052 (mainnet) - not found.

(base) ekaterinapavlova@MacBook-Air-4 neofs-node % ./bin/neofs-cli object search --ttl 1 -r st1.storage.fs.neo.org:8080 -w /Users/ekaterinapavlova/Workplace/neo-go/panelwallet1.json --filters 'Block EQ 89052' --cid 7a1cn9LNmAcHjESKWxRGG7RSZ55YHJF6z2xDLTCuTZ6c -t 60s

Found 0 objects.
(base) ekaterinapavlova@MacBook-Air-4 neofs-node % ./bin/neofs-cli object search --ttl 1 -r st2.storage.fs.neo.org:8080 -w /Users/ekaterinapavlova/Workplace/neo-go/panelwallet1.json --filters 'Block EQ 89052' --cid 7a1cn9LNmAcHjESKWxRGG7RSZ55YHJF6z2xDLTCuTZ6c -t 60s

Found 0 objects.
(base) ekaterinapavlova@MacBook-Air-4 neofs-node % ./bin/neofs-cli object search --ttl 1 -r st3.storage.fs.neo.org:8080 -w /Users/ekaterinapavlova/Workplace/neo-go/panelwallet1.json --filters 'Block EQ 89052' --cid 7a1cn9LNmAcHjESKWxRGG7RSZ55YHJF6z2xDLTCuTZ6c -t 60s

Found 0 objects.
(base) ekaterinapavlova@MacBook-Air-4 neofs-node % ./bin/neofs-cli object search --ttl 1 -r st4.storage.fs.neo.org:8080 -w /Users/ekaterinapavlova/Workplace/neo-go/panelwallet1.json --filters 'Block EQ 89052' --cid 7a1cn9LNmAcHjESKWxRGG7RSZ55YHJF6z2xDLTCuTZ6c -t 60s

Found 0 objects.

@AnnaShaleva
Copy link
Member Author

Block 91326 (FS testnet) exisits.:

Can you GET/HEAD it?

89052 (mainnet) - not found.

What if try to request more times?

@AliceInHunterland
Copy link
Contributor

Can you GET/HEAD it?

i dont know OID, so I cant

@AliceInHunterland
Copy link
Contributor

AliceInHunterland commented Nov 2, 2024

What if try to request more times?

Yet nothing

@AnnaShaleva
Copy link
Member Author

Block 91326 (FS testnet) exisits.:

i dont know OID, so I cant

You have it, it's HwkGuEX5bra2et7pm2CD2fs4vi9oqfrSrKjkX4GBc6m5. The question is about existing object.

@AliceInHunterland
Copy link
Contributor

AliceInHunterland commented Nov 2, 2024

HwkGuEX5bra2et7pm2CD2fs4vi9oqfrSrKjkX4GBc6m5

with this one everything is okay, its correct and returned. GET/HEAD several times from different nodes and with/without --ttl 1

@AliceInHunterland
Copy link
Contributor

AliceInHunterland commented Nov 2, 2024

For --cid 7a1cn9LNmAcHjESKWxRGG7RSZ55YHJF6z2xDLTCuTZ6c checked from 0 to 350k received such results.
(96 blocks)

Block 89052 is missing from the storage
Block 89131 is missing from the storage
Block 89151 is missing from the storage
Block 89152 is missing from the storage
Block 89160 is missing from the storage
Block 89171 is missing from the storage
Block 89180 is missing from the storage
Block 89238 is missing from the storage
Block 89260 is missing from the storage
Block 89263 is missing from the storage
Block 89278 is missing from the storage
Block 89280 is missing from the storage
Block 89318 is missing from the storage
Block 89323 is missing from the storage
Block 89327 is missing from the storage
Block 89329 is missing from the storage
Block 89332 is missing from the storage
Block 89333 is missing from the storage
Block 89338 is missing from the storage
Block 89352 is missing from the storage
Block 89373 is missing from the storage
Block 89380 is missing from the storage
Block 89385 is missing from the storage
Block 89389 is missing from the storage
Block 89392 is missing from the storage
Block 89398 is missing from the storage
Block 89400 is missing from the storage
Block 89408 is missing from the storage
Block 89413 is missing from the storage
Block 89443 is missing from the storage
Block 89445 is missing from the storage
Block 89493 is missing from the storage
Block 89494 is missing from the storage
Block 89500 is missing from the storage
Block 89503 is missing from the storage
Block 89520 is missing from the storage
Block 89533 is missing from the storage
Block 89547 is missing from the storage
Block 89553 is missing from the storage
Block 89554 is missing from the storage
Block 89558 is missing from the storage
Block 89560 is missing from the storage
Block 89567 is missing from the storage
Block 89580 is missing from the storage
Block 89585 is missing from the storage
Block 89586 is missing from the storage
Block 89589 is missing from the storage
Block 89598 is missing from the storage
Block 89599 is missing from the storage
Block 89600 is missing from the storage
Block 89606 is missing from the storage
Block 89628 is missing from the storage
Block 89633 is missing from the storage
Block 89636 is missing from the storage
Block 89638 is missing from the storage
Block 89645 is missing from the storage
Block 89655 is missing from the storage
Block 89664 is missing from the storage
Block 89668 is missing from the storage
Block 89674 is missing from the storage
Block 89685 is missing from the storage
Block 89687 is missing from the storage
Block 89693 is missing from the storage
Block 89694 is missing from the storage
Block 89698 is missing from the storage
Block 89708 is missing from the storage
Block 89709 is missing from the storage
Block 89716 is missing from the storage
Block 89725 is missing from the storage
Block 89733 is missing from the storage
Block 89739 is missing from the storage
Block 89745 is missing from the storage
Block 89746 is missing from the storage
Block 89749 is missing from the storage
Block 89756 is missing from the storage
Block 89802 is missing from the storage
Block 89804 is missing from the storage
Block 89817 is missing from the storage
Block 89824 is missing from the storage
Block 89839 is missing from the storage
Block 89846 is missing from the storage
Block 89858 is missing from the storage
Block 89861 is missing from the storage
Block 89862 is missing from the storage
Block 89864 is missing from the storage
Block 89884 is missing from the storage
Block 89897 is missing from the storage
Block 89901 is missing from the storage
Block 89921 is missing from the storage
Block 89924 is missing from the storage
Block 89956 is missing from the storage
Block 89958 is missing from the storage
Block 89959 is missing from the storage
Block 89961 is missing from the storage
Block 89966 is missing from the storage
Block 89997 is missing from the storage

AliceInHunterland added a commit that referenced this issue Nov 2, 2024
AliceInHunterland added a commit that referenced this issue Nov 5, 2024
AliceInHunterland added a commit that referenced this issue Nov 5, 2024
AliceInHunterland added a commit that referenced this issue Nov 5, 2024
AliceInHunterland added a commit that referenced this issue Nov 5, 2024
@AliceInHunterland
Copy link
Contributor


2024-11-05 13:39:12.765 | Uploaded index file 26 |  
-- | -- | --
  |   | 2024-11-05 13:39:11.254 | 0 missing block(s) processed for index file 26, uploading index file... |  
  |   | 2024-11-05 13:39:11.228 | Index file 26 generated, checking for the missing blocks... |  
  |   | 2024-11-05 12:46:05.259 | Uploading index files... |  
  |   | 2024-11-05 12:46:05.259 | First block of latest incomplete batch uploaded to NeoFS container: 6590000 |  
  |   | 2024-11-05 12:43:02.745 | Chain block height: 6638208 |  
  |   | 2024-11-05 12:41:59.568 | failed to upload index files: block 3349950 is missing from the storage |  
  |   | 2024-11-05 06:40:39.327 | Index file 26 generated, checking for the missing blocks... |  
  |   | 2024-11-05 05:17:15.621 | Uploading index files... |  
  |   | 2024-11-05 05:17:15.621 | First block of latest incomplete batch uploaded to NeoFS container: 6580000 |  
  |   | 2024-11-05 05:12:19.370 | Chain block height: 6636493 |  
  |   | 2024-11-05 05:10:47.285 | failed to upload index files: block 3378959 is missing from the storage |  
  |   | 2024-11-05 00:53:58.817 | Index file 26 generated, checking for the missing blocks... |  
  |   | 2024-11-05 00:07:24.780 | Uploaded index file 25


After 3 times all blocks were found. Depends on nspcc-dev/neofs-node#2721.

AliceInHunterland added a commit that referenced this issue Nov 6, 2024
AliceInHunterland added a commit that referenced this issue Nov 6, 2024
@AnnaShaleva
Copy link
Member Author

For the record: some blocks are really missing from NeoFS. We have 18ed3bb that should solve this issue, but we're not 100% sure if it helps, hence let's keep this issue open, test the updated uploading script on mainnet one more time and check if it solves the problem with gaps in blocks uploading.

@AliceInHunterland
Copy link
Contributor

It seems to be still relevant, based on new uploader runs:

2024-11-11 07:18:23.373 | failed to upload index files: index file 0: block 1618 is missing from the storage |  
2024-11-11 07:17:54.825 | Index file 0: fetching missing block 1618


@roman-khimov
Copy link
Member

How does it really happen? You're uploading an object for block 1618 and... get an error? get a successful reply? don't get any reply? Can we get more data from logs in this case?

@AliceInHunterland
Copy link
Contributor

How does it really happen? You're uploading an object for block 1618 and... get an error? get a successful reply? don't get any reply? Can we get more data from logs in this case?

From the logs, it was successfully uploaded, we didn't have any related errors, that could explain it.

@roman-khimov
Copy link
Member

And we don't know its OID? I think we need some verbose mode with more data in this case to trace the problem. Seems like it happens often enough and we don't need to wait for full upload cycle to complete.

@AliceInHunterland
Copy link
Contributor

And we don't know its OID? I think we need some verbose mode with more data in this case to trace the problem. Seems like it happens often enough and we don't need to wait for full upload cycle to complete.

No OID. #3655 will be as a verbose mode - it will put OIDs directly into the index file. Or we can try to write everything in logs to see it faster -200k blocks will be enough to catch.

@roman-khimov
Copy link
Member

Log line is much easier to add, let's add it and then try a new upload, see where it fails and then take a look at it from the NeoFS side. Either it loses an object that was accepted (bad), or it reports an incorrect status for failed upload (bad). Or something else. Maybe related to nspcc-dev/neofs-node#2975, maybe not.

AliceInHunterland added a commit that referenced this issue Nov 11, 2024
Logs for testing purpose.Refs. https://github
.com//issues/3658#issuecomment-2468210667

Signed-off-by: Ekaterina Pavlova <[email protected]>
@AnnaShaleva
Copy link
Member Author

The problem is on NeoFS side, some objects are silently failed to be uploaded. Fixed by nspcc-dev/neofs-node#3014, hence I consider this issue as not planned, no actions required from NeoGo side.

@AnnaShaleva AnnaShaleva closed this as not planned Won't fix, can't repro, duplicate, stale Nov 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working I4 No visible changes S4 Routine U1 Critically important to resolve quickly
Projects
None yet
Development

No branches or pull requests

3 participants