Add helper script to get list of most frequent values for a key from taginfo #192
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The idea is that this script would help to define the values that need to be rendered,
finding the most frequent ones and avoiding explicit negatives and misspellings.
This commit just adds the script, but doesn't make actual use of it anywhere yet.
The idea follows discussions on #116 and #182.
@gravitystorm : when I said that 'rendering "all" values of a tag is often desirable', the quotes were important : We clearly do not want to render misspelings and "no" values. So we use a whitelist and avoid catch-alls.
But for some keys like 'shop' or 'tunnel', that whitelist can become large and unwieldy. This script just makes the creation and maintenance of those whitelists easyer, based on the observations that cutting off the long tail and some frequent-but-unwanted values makes a pretty nice list.
Of course, the script's output must be reviewed. But eventually, instead of storing the actual output, we could just store the parameters (key, threshold, blacklist). That's to be done on a case by case basis.