Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add search by metadata link to home page #1034

Closed
maxsibilla opened this issue Nov 14, 2023 · 8 comments · Fixed by #1039 or #1046
Closed

Add search by metadata link to home page #1034

maxsibilla opened this issue Nov 14, 2023 · 8 comments · Fixed by #1039 or #1046
Assignees

Comments

@maxsibilla
Copy link
Contributor

maxsibilla commented Nov 14, 2023

We need to figure out the best method to add a link to https://data.sennetconsortium.org/discover/metadata on the home page. The navbar is getting cluttered so we may need to restructure that or add a second one. Another option might just to have a section only on the home page that links out to the various search pages: metadata and files.

@maxsibilla
Copy link
Contributor Author

maxsibilla commented Nov 14, 2023

@tjmadonna
Copy link
Contributor

tjmadonna commented Nov 14, 2023

@maxsibilla The All Blocks Sample returns a 503 response from the search-api. The error message is:

Trying to create too many buckets. Must be less than or equal to: [65535] but was [81065]. This limit can be set by changing the [search.max_buckets] cluster level setting.

I'm thinking most of these are being caused by the creation_date and modification_data facets

@maxsibilla
Copy link
Contributor Author

@tjmadonna we might need to rethink how we request this data then. We could simply increase the max_buckets but that might just kick this issue down the road. We should first try to identify why so many buckets are being created and then go from there to determine how to modify the request.

@tjmadonna
Copy link
Contributor

@maxsibilla After looking into this a little more, I see that an All Block Samples request could potentially contain three number range facets (with histograms). I think this might be the issue. I'll try to confirm and fix that tomorrow.

@maxsibilla
Copy link
Contributor Author

@tjmadonna in that case we might just need to show that on request (as opposed to by default)

@tjmadonna
Copy link
Contributor

This is being caused by metadata.volume_value histogram aggregation. The production data has a maximum value of 80850 and we are setting an interval size of 1.

min = 9000
max = 80850.0

@maxsibilla
Copy link
Contributor Author

Yea that would do it. I guess the initial question is do we need to show this and if so should we display it as a histogram

@tjmadonna
Copy link
Contributor

We're probably going to run into the same issue with metadata.weight_value and metadata.pathology_distance. Right now I don't believe we have any metadata with those values set.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Archived in project
2 participants