improve sorting when filtering datasets #2834

philippotto · 2018-06-29T13:03:05Z

I used the dice coefficient to sort the datasets by best match. This works quite well for the suffix example which covers probably 95% of the cases where this is needed. I'm a bit concerned about performance, since the sorting resulted in a noticable lag for 1000 datasets. However, my datasets all had almost the same name. So, if the search query filters the amount of datasets considerably, the lag should be reduced, as well. In the end, the lab should just give final feedback (after deployment, since the dev server doesn't have many datasets). For the copy&paste use case it should definitely be enough.

Mailable description of changes:

When using the search functionality in the datasets view, the datasets will be sorted so that the best match is shown first. If a different sorting is desired, the sorting-arrows in the columns can still be used to change the sorting criteria.

URL of deployed dev instance (used for testing):

https://dicesortdatasets.webknossos.xyz

Steps to test:

open dashboard (default sort order should be "created")
sort by a different criteria
type something in the search bar (blue sorting-arrow should turn gray since the datasets are sorted by best match now)
sort again by another criteria --> should work even after refining the search query

Issues:

fixes Improve dataset search/sorting #2818

Ready for review

daniel-wer

Nice, works well. I'm alright with deploying it for performance feedback.

daniel-wer · 2018-07-02T11:46:11Z

app/assets/javascripts/dashboard/advanced_dataset/advanced_dataset_view.js

+        ? _.chain(filteredDataSource)
+            .map(row => ({
+              row,
+              diceCoefficient: dice(row.name, this.props.searchQuery),


I'm not sure whether this will result in strange results if the user searches for a word in the description. The search results are filtered using the name and description properties, but the diceCoefficient only takes into account the dataset name for sorting.
Including the description in the diceCoefficient will probably slow it down further, right?

Actually I noticed the description is not even displayed in the advanced dataset view, so it's probably alright to remove "description" in line 66.

Searching for some word of the description doesn't work for me anyways, neither in the advanced nor on the spotlight dataset view, not sure why.

Searching for some word of the description doesn't work for me anyways, neither in the advanced nor on the spotlight dataset view, not sure why.

Really? It works for me 🤔

I'm not sure whether this will result in strange results if the user searches for a word in the description. The search results are filtered using the name and description properties, but the diceCoefficient only takes into account the dataset name for sorting.

My assumption is that the search is only used for dataset name matching, anyway. In the rare case, that a user searches the description, the ordering might be suboptimal (or "random"?), but I don't think that's a showstopper, as the user's had to scroll through a long list before anyway. We can remove the description from the search parameters, but I'd leave it as is for now until someone complains. That way, we don't remove existing functionality from the view.

I assumed that the string Original data and segmentation is the description and I could search for parts of that, but that string is only a default string that is shown if there is no description. Setting a custom description and searching for it works as intended, all fine :)
Let's leave the description in there, your argumentation makes sense to me!

improve sorting when filtering datasets

68df245

philippotto added enhancement frontend labels Jun 29, 2018

philippotto self-assigned this Jun 29, 2018

philippotto requested a review from daniel-wer June 29, 2018 13:03

update snapshots

5fc5123

daniel-wer approved these changes Jul 2, 2018

View reviewed changes

Merge branch 'master' into dice-sort-datasets

8f8d0ef

philippotto merged commit eb08d0f into master Jul 2, 2018

normanrz deleted the dice-sort-datasets branch July 2, 2018 17:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve sorting when filtering datasets #2834

improve sorting when filtering datasets #2834

philippotto commented Jun 29, 2018 •

edited

Loading

daniel-wer left a comment

daniel-wer Jul 2, 2018

daniel-wer Jul 2, 2018

daniel-wer Jul 2, 2018

philippotto Jul 2, 2018

daniel-wer Jul 2, 2018

improve sorting when filtering datasets #2834

improve sorting when filtering datasets #2834

Conversation

philippotto commented Jun 29, 2018 • edited Loading

Mailable description of changes:

URL of deployed dev instance (used for testing):

Steps to test:

Issues:

daniel-wer left a comment

Choose a reason for hiding this comment

daniel-wer Jul 2, 2018

Choose a reason for hiding this comment

daniel-wer Jul 2, 2018

Choose a reason for hiding this comment

daniel-wer Jul 2, 2018

Choose a reason for hiding this comment

philippotto Jul 2, 2018

Choose a reason for hiding this comment

daniel-wer Jul 2, 2018

Choose a reason for hiding this comment

philippotto commented Jun 29, 2018 •

edited

Loading