generated from NASA-PDS/template-repo-python
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* [skeleton] implement reindexer sweeper * update metadata flag to use date instead of boolean * [revert me] add test init code * implement reindexer sweeper, with types resolved from the *-dd index, or as "keyword" if not present there Manual tests against docker registry: - logged counts are correct - missing mappings are added - types of missing mappings change according to the resolved typename - metadata updates are written to db - metadata updates are sufficient to trigger re-index, causing previously-unsearchable properties to become searchable. - presence of metadata attribute excludes document from document set on subsequent runs * implement harvest-time filter to ensure that products harvested mid-sweep do not erroneously get flagged as processed * implement logging of problematic harvest timestamp span and harvest software versions * clean up code * remove test code * improve comment * add mypy ignores - None-guard is provided by conditionals * add registry-dd to allowed index types for resolve_multitenant_index_name() * ensure reindexer sweeper captures all relevant documents with a single sweep * improve logging * implement batching approach in reindexer sweeper * squash! implement batching approach in reindexer sweeper * [weeeird bugfix] Patch apparent issues when paginating. See comments * map special-case properties onto their types and incorporate them into the resolution logic * implement stall while update indexing queue is backed up * disable noisy log * tweak stall time/log * make reindexer hits count more human-friendly in logs * clean up logging * exclude ops:Provenance* properties from canonical_type_undefined_property_names * bump hits_stall_tolerance from 5% to 10% of batch_size_limit this should prevent unintended continuous looping * fix stall logic * fix format_hits_count() * change type hint to indicate that consumable iterators are not appropriate this is because the retry would pass the consumed iterator to subsequent calls * Incorporate detection/log/retry of HTTP429 (circuit-breaking throttle) * remove manual stall logic * disable default typing, per jpadams * re-enable generation of updates for docs having properties not in mappings protection against race condition is provided by harvest-time constraint to LT sweeper execution timestamp * support all ISO-formatted harvest timestamp strings dateutil is the official third-party library for parsing * correct erroneous log message * bugfix edge-cases * demote noisy log * deduplicate missing sweepers property logs * remove cruft * flesh out static type mappings * fix infinite loop when there are fewer hits than a full batch * comment out log_filepath * lint * add explanation
- Loading branch information
1 parent
4c3812d
commit bd99a08
Showing
9 changed files
with
432 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Empty file.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
REINDEXER_FLAG_METADATA_KEY = "ops:Provenance/ops:reindexed_at" |
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters