Initial start on auto-generating taginfo JSON #2754

peternewman · 2021-04-16T14:17:55Z

I'm not expecting any serious code reviews yet, more just sort of getting it out there a bit, and I guess starting to look at workflows.

TODO:

Combine the function names (e.g. add versus updateCheckDateForKey) - "Whether add or modify doesn't really matter, if it tags it, it tags it."

westnordost · 2021-04-16T21:17:05Z

Good start! Though, I can't say much about this yet because I haven't looked closer at how the file would need to look like for taginfo to work with it properly.

I'd just say that this should go into an own script.

I wonder if it makes more sense to generate this file once every release or as a hook like @FloEdelmann 's wiki-script. If the former, the proper location would be buildSrc/src/main/java/TaginfoJasonTask.kt and then register a task in the project's build.gradle.kts

The old taginfo json from @goldfndr listed the "quest fill" and "quest filter" separately. For a new version, I think it is better to only keep the "quest fill" (=how the app tags something) part, because obviously, for the filter, the app sometimes interprets deprecated, outdated, ambiguous, not-really-used and "wrong" tags, just so that the quest is not shown for too many elements. These kind of tags should then not appear in taginfo as "StreetComplete uses these tags" because this paints a false picture of the situation:

It may make it look like StreetComplete is propagating the use of this tag
It may look like a tag is "accepted and used" because StreetComplete uses it

matkoniecz · 2021-04-16T21:25:53Z

If you reach stage when it starts generating things and you are unaware about bugs... You can publish output as gist and I can look at some quests that I know well. Maybe I will spot some bugs this way.

And it is really cool that you are doing this!

peternewman · 2021-04-16T23:29:04Z

Good start! Though, I can't say much about this yet because I haven't looked closer at how the file would need to look like for taginfo to work with it properly.

Thanks. It's just the same as the existing one, I was initially trying to just automate what @goldfndr had done manually. Most of the fields are just free text, so obviously his hand-crafted explanations are likely to be better, but only if it's kept up to date all the time, hopefully we can be good enough via some automation:
https://wiki.openstreetmap.org/wiki/Taginfo/Projects

I'd just say that this should go into an own script.

Yes, although it shares lots of common requirements with @FloEdelmann 's code, such as extracting quest names and lists of quests, so unless we're going to have a helper library too... I was going to add some command line arguments (if Kotlin does them, so you would decide which output to produce).

I wonder if it makes more sense to generate this file once every release or as a hook like @FloEdelmann 's wiki-script. If the former, the proper location would be buildSrc/src/main/java/TaginfoJasonTask.kt and then register a task in the project's build.gradle.kts

Possibly, we can automate the generation like @FloEdelmann 's so it just gets built automagically each time there is a release, the benefit then is you don't need to do anything. I guess with both there's an argument that it's beneficial to have the data available sooner (i.e. as soon as a quest is released), either because people might be running their own builds of the code, either forks or nightly releases or whatever, and also because as soon as its been committed it means it's going to appear at some point soon, so in the wiki case it would mean the page could be updated before it's released, so it's immediately current when people checkout the new version, and in the Taginfo case, an accessible map designer for example might choose to prioritise adding a feature to their app knowing SC supports it and therefore data will exist and grow. Essentially once the code has been written, it's going to appear so having a little head start can't hurt.

The old taginfo json from @goldfndr listed the "quest fill" and "quest filter" separately. For a new version, I think it is better to only keep the "quest fill" (=how the app tags something) part, because obviously, for the filter, the app sometimes interprets deprecated, outdated, ambiguous, not-really-used and "wrong" tags, just so that the quest is not shown for too many elements. These kind of tags should then not appear in taginfo as "StreetComplete uses these tags" because this paints a false picture of the situation:

I guess it depends what people are doing with the data. Avoiding the query part is certainly likely to make the code far simpler, as I don't have to parse and understand the Overpass type stuff, just some basic Kotlin methods. Presumably it's mostly irrelevant as for the majority of current tags and ones we're "filling", we're likely to be filtering on them too, to ensure they aren't already set. Are there actually many which we only filter on?

* It may make it look like StreetComplete is propagating the use of this tag

* It may look like a tag is "accepted and used" because StreetComplete uses it

Yep, understood. Although we could add some descriptions about those usages.

peternewman · 2021-04-16T23:32:22Z

If you reach stage when it starts generating things and you are unaware about bugs... You can publish output as gist and I can look at some quests that I know well. Maybe I will spot some bugs this way.

Still on my list is to actually write the JSON file and commit it to the repo, it would be in the runner output, but the runner doesn't currently run apart from on a release due to the workflow config. I might tweak that to aid development. Thanks for the offer anyway. As long as a quest isn't setting things in multiple different ways (which I think a few are), it's likely most bugs will be fairly obvious as it will complain it can't process the data.

And it is really cool that you are doing this!

Thanks. I thought it seemed like an interesting little challenge with a useful outcome, without me having to get into proper Android and GUI development! I'm still a bit worried it might end up rather brittle and fragile, but probably less hassle than using a full Kotlin lexer to do it properly!

matkoniecz · 2021-04-17T04:31:22Z

Are there actually many which we only filter on?

From just implemented https://github.com/streetcomplete/StreetComplete/pull/2753/files

sets barrier (several values), filters on barrier=yes and:

         and !man_made
         and !historic
         and !military
         and !power
         and !tourism
         and !attraction
         and !amenity
         and !leisure

(as there are rare tagging mistakes of leisure=playground barrier=yes type that could result in mapper selecting "not existing" option for question about specifying barrier type. This filtering should reduce this anyway rare cases to something basically not happening)

Or building levels quest with

!building:levels and !height and !building:height
         and !man_made and location != underground and ruins != yes

filter (though here only building:height is a bad value, other would be kind of useful to list)

I was going to add some command line arguments (if Kotlin does them, so you would decide which output to produce).

It may be preferable to have only actually used code (as unused code triggered by arguments is by definition not used but still requires maintenance/review etc)

probably less hassle than using a full Kotlin lexer to do it properly!

Though after some times of doing this I am nowadays really preferring starting from a proper parser if at all available :)

westnordost · 2021-06-06T20:04:46Z

Are you still working on this?

…lete into data-actions

peternewman · 2021-06-07T14:15:05Z

Are there actually many which we only filter on?

(as there are rare tagging mistakes of leisure=playground barrier=yes type that could result in mapper selecting "not existing" option for question about specifying barrier type. This filtering should reduce this anyway rare cases to something basically not happening)

Hmm, as @westnordost mentions, it's how we flag those things without making it look like they are good/relevant tags, not errors.

filter (though here only building:height is a bad value, other would be kind of useful to list)

I think the other thing is parsing the query stuff is probably even harder than the changes stuff.

I was going to add some command line arguments (if Kotlin does them, so you would decide which output to produce).

It may be preferable to have only actually used code (as unused code triggered by arguments is by definition not used but still requires maintenance/review etc)

It is used code, some bits are common to the wiki stuff and the JSON, some are only used by one or the other. It should perhaps be in a separate library file if that's not excessive for a little script. Take a look now and it should make more sense.

probably less hassle than using a full Kotlin lexer to do it properly!

Though after some times of doing this I am nowadays really preferring starting from a proper parser if at all available :)

Yeah, I'm still a bit undecided, it seems to be mostly working. Some bits would be a bit easier with a lexer/parser, but I'm not sure it will magically fix the issues, as I'll still need to process all the tokens it returns. In C++ land I'd probably do something clever with some macros and ifdefs so you could compile the code in two different ways to be able to get a list of changes it would apply or something.

…lete into data-actions

peternewman · 2021-06-07T14:39:28Z

Are you still working on this?

I had mostly forgotten, or been doing other things.

I've now picked it up and it covers most stuff I think. There are a few fiddly edge cases, but I wonder if it's worth merging as is/after a small tidy up, and perhaps having both JSON files in parallel. E.g. @goldfndr manual one still covers all the query stuff, which this doesn't touch at all yet, but equally this has already picked up say the bollard and CCTV camera quests with no input from anyone, so it's doing useful things already. I'm pretty confident it's not lying, and while it's incomplete in some ways, so is the existing one. I believe TagInfo is quite happy with having multiple JSON files for the same project, so there's no major downside there.

peternewman · 2021-06-23T10:16:56Z

Remember to update https://github.com/streetcomplete/StreetComplete/blob/master/CONTRIBUTING.md#improving-documentation when this is in!

Edit: currently removed in 3cd0e73 to be tidied as appropriate depending on how this PR progresses.

…ce it entirely

Resync with master

…lete into data-actions

matkoniecz · 2021-12-13T13:13:23Z

@peternewman Are you still working on this one? Or is it waiting for someone to adopt it?

matkoniecz · 2021-12-13T13:19:07Z

.github/generate-quest-metadata.main.kts

+    when (task) {
+      "wiki" -> generateWikiCsv(questNames, questFiles)
+      "json" -> generateTaginfoJson(questNames, questFiles)
+      else -> { // Note the block


What this comment is referring to? { }?

Yeah. I think it's possibly because if I didn't put the block in, it didn't print or something?

matkoniecz · 2021-12-13T13:20:20Z

.github/generate-quest-metadata.main.kts

+        "https://github.com/streetcomplete/StreetComplete",
+        "https://wiki.openstreetmap.org/wiki/StreetComplete",
+        "https://raw.githubusercontent.com/streetcomplete/StreetComplete/master/app/src/main/res/mipmap-xhdpi/ic_launcher.png",
+        "Peter Newman"


Are you sure that you want to be contacted directly about maintaining this script?

Possibly not, a bit of a placeholder at least. I think the spec suggested it should be an individual... 🤷

matkoniecz · 2021-12-13T13:29:04Z

General question: would it be easier to parse Kotlin code into some AST? For example with https://github.com/kotlinx/ast

I did may share of "parsing is complicated, I will solve it with bunch of simple regexps". And after several months I wished that I did it properly with parsing - as I kept encountering one edge case after another and ended with monster regexp which was uneditable nightmare.

End result was dynamically build regexp that was incomprehensible large and really poor in editing (at least I had tests).

If that would be regexp - then it really needs some tests, otherwise (I know from experience) this will be irritating source of regressions if this will be ever changed.

smichel17 · 2021-12-13T17:37:03Z

would it be easier to parse Kotlin code into some AST?

I was going to suggest the same thing. At the very least, we already have code for parsing a string to an ElementFilterExpression (EFE); it shouldn't be too much work* to have an EFE spit out an equivalent tagInfo for it.

For more complicated parsing, perhaps we could take an approach something like a javadoc: embed a comment above the getApplicableElements function with the relevant information. That would still require manual maintenance, but at least it would be right in your face when you're changing the code, rather than off in a separate repo.

*famous last words, I know

…lete into data-actions

peternewman · 2022-01-28T15:41:37Z

#3202 was caught, but as it was nothing fatal and obnoxious to fix the relevant scripts were commented out - in hope that Sophox situation will be fixed on Sophox side

Yeah, it just avoids that mad rush finding loads of stuff is broken just as you're about to release.

Should we really combine all those (e.g. add versus updateCheckDateForKey)?

Yes, I think yes. Whether add or modify doesn't really matter, if it tags it, it tags it.

Okay, I'll add to the TODO list, it's vaguely useful during debug to know where it's come from.

Regarding check date, it should probably be interpreted like what it tags. F.e. the taginfo JSON should show that the opening hours quest tags opening_hours but also check_date:opening_hours.

I think it already did this before you commented, it definitely does now anyway.

@peternewman Are you still working on this one? Or is it waiting for someone to adopt it?

On and off, although mostly off unfortunately, too many other bits to do too. I wouldn't hugely object if someone wanted to take it on, I think I've done most of the easy bits! I've just updated it all to cope with all the changes due to #3665 .

General question: would it be easier to parse Kotlin code into some AST? For example with https://github.com/kotlinx/ast

From a quick look at that, I'm not sure it actually solves the hard part of this, which is dealing with the logic and enums etc. It would avoid the regex, and some potential errors there, but probably in exchange for more complicated/less readable means of processing the results

e.g. given this code:

StreetComplete/app/src/main/java/de/westnordost/streetcomplete/quests/cycleway/AddCycleway.kt

Lines 196 to 216 in 9c6d3e2

    
           private enum class Side(val value: String) { 
        
               LEFT("left"), RIGHT("right"), BOTH("both") 
        
           } 
        
           private fun applyCyclewayAnswerTo(cycleway: Cycleway, side: Side, dir: Int, tags: Tags) 
        
           { 
        
               val directionValue = when { 
        
                   dir > 0 -> "yes" 
        
                   dir < 0 -> "-1" 
        
                   else -> null 
        
               } 
        
               val cyclewayKey = "cycleway:" + side.value 
        
               when (cycleway) { 
        
                   NONE, NONE_NO_ONEWAY -> { 
        
                       tags[cyclewayKey] = "no" 
        
                   } 
        
                   EXCLUSIVE_LANE, ADVISORY_LANE, UNSPECIFIED_LANE -> { 
        
                       tags[cyclewayKey] = "lane" 
        
                       if (directionValue != null) { 
        
                           tags["$cyclewayKey:oneway"] = directionValue

We need to produce the JSON version of these statements; that to my mind is the hard bit:

tags["left"] = "no"
tags["left"] = "lane"
tags["left:oneway"] = "yes"
tags["left:oneway"] = "-1"
tags["right"] = "no"
tags["right"] = "lane"
tags["right:oneway"] = "yes"
tags["right:oneway"] = "-1"
tags["both"] = "no"
tags["both"] = "lane"
tags["both:oneway"] = "yes"
tags["both:oneway"] = "-1"

As I mentioned, I could possibly conceive you might be able to do some preprocessor magic in C/C++, I don't know if Kotlin offers equivalents, or if it would even help.

Maybe if everything could be switched to code similar to these (which in fairness the Cycleway is already):

StreetComplete/app/src/main/java/de/westnordost/streetcomplete/quests/barrier_type/BarrierType.kt

Lines 31 to 48 in 9c6d3e2

    
           fun BarrierType.applyTo(tags: Tags) { 
        
               tags["barrier"] = this.osmValue 
        
               when (this) { 
        
                   BarrierType.STILE_SQUEEZER -> { 
        
                       tags["stile"] = "squeezer" 
        
                   } 
        
                   BarrierType.STILE_LADDER -> { 
        
                       tags["stile"] = "ladder" 
        
                   } 
        
                   BarrierType.STILE_STEPOVER_WOODEN -> { 
        
                       tags["stile"] = "stepover" 
        
                       tags["material"] = "wood" 
        
                   } 
        
                   BarrierType.STILE_STEPOVER_STONE -> { 
        
                       tags["stile"] = "stepover" 
        
                       tags["material"] = "stone" 
        
                   } 
        
               }

Then we'd just need to write code which feeds in all combinations of inputs (i.e. Cycleway x Side x dir) and captures all the output; but then in this case, dir is currently an Int so has thousands of values; we'd need to do the Int to enum conversion first to make it manageable.

I did may share of "parsing is complicated, I will solve it with bunch of simple regexps". And after several months I wished that I did it properly with parsing - as I kept encountering one edge case after another and ended with monster regexp which was uneditable nightmare.

Yeah I've found a few quirks here which I've caught and fixed, some of which is inconsistency in the codebase (e.g. func = func2 versus func { func2 } etc).

If that would be regexp - then it really needs some tests, otherwise (I know from experience) this will be irritating source of regressions if this will be ever changed.

Agreed, I've started collecting the examples so some tests can be written.

would it be easier to parse Kotlin code into some AST?

I was going to suggest the same thing. At the very least, we already have code for parsing a string to an ElementFilterExpression (EFE); it shouldn't be too much work* to have an EFE spit out an equivalent tagInfo for it.

Remember we're only interested in the tagging side, not the selection side (@westnordost preference, but I can see the logic so we don't flag e.g. roundabout=yes in our road quest or some other random/obscure tag). So I don't think any EFE stuff will help.

For more complicated parsing, perhaps we could take an approach something like a javadoc: embed a comment above the getApplicableElements function with the relevant information. That would still require manual maintenance, but at least it would be right in your face when you're changing the code, rather than off in a separate repo.

Again, we don't care about getApplicableElements just applyAnswerTo. I think @westnordost didn't want JSON in the source folders, we talked about some JSON outside of the main source. I'd imagine he may feel the same about Javadoc style commenting.

….toCheckDateString

westnordost · 2022-02-18T18:36:57Z

This has been open as draft since almost a year. Is this still being worked on?

westnordost · 2022-04-10T11:10:33Z

Closing as this seems to be abandoned.

westnordost · 2022-04-10T11:11:18Z

If you intend to finish it, please only open a new PR when it is ready for review or if you need help (then as draft).

peternewman added 2 commits April 15, 2021 11:30

Very initial start on auto-generating taginfo JSON

f74eb5b

Initial basic MVP generating taginfo JSON

685f8f5

Merge branch 'master' of https://github.com/streetcomplete/StreetComp…

50fbab7

…lete into data-actions

This was referenced Jun 6, 2021

add camera type quest #2856

Merged

Fix a suspected Achievements bug #2952

Merged

peternewman added 6 commits June 7, 2021 00:03

Standardise the name of a file to match its friends

7df21b1

Handle medium complexity enums and fix some other behaviour

bec5e7f

Deal with enums and lots of other edge cases

d2f525f

Ignore the taginfo JSON for now

1588c69

Actually generate the TagInfo JSON, add a second workflow

6e5c35b

Rename the kotlin file now it's doing multiple jobs

13e6d1d

peternewman added 3 commits June 7, 2021 15:18

Always run the workflow during development of it

f26b04b

Merge branch 'master' of https://github.com/streetcomplete/StreetComp…

8fdcf0c

…lete into data-actions

Skip keys with unprocessed string substitutions

0b9458b

peternewman added 2 commits June 7, 2021 15:46

Make some debugging more descriptive

14bb9f4

Handle the has based changes and updateCheckDateForKey

deb508a

peternewman changed the title ~~Initial basic start on auto-generating taginfo JSON~~ Initial start on auto-generating taginfo JSON Jun 23, 2021

Delete the note about the old manual version assuming this will repla…

3cd0e73

…ce it entirely

peternewman mentioned this pull request Jun 23, 2021

Improve some wording of contributing.md #2994

Merged

peternewman added 2 commits August 18, 2021 02:36

Try and allow a PR to merge

f9b5788

Merge pull request #4 from streetcomplete/master

15c0c5a

Resync with master

peternewman added 2 commits September 9, 2021 14:50

Filter out some of the types of changes we bother to report to taginfo

623fabf

Merge branch 'master' of https://github.com/streetcomplete/StreetComp…

6588808

…lete into data-actions

peternewman mentioned this pull request Oct 26, 2021

Handle constants used in quests, e.g. Crossing Sound bryceco/GoMap#572

Merged

westnordost mentioned this pull request Nov 18, 2021

Remove sidewalk(:*)=none instances goldfndr/StreetCompleteJSON#19

Open

matkoniecz reviewed Dec 13, 2021

View reviewed changes

smichel17 mentioned this pull request Dec 13, 2021

stop encouraging to edit a dead repository #3589

Merged

peternewman added 11 commits January 27, 2022 01:58

Merge branch 'master' of https://github.com/streetcomplete/StreetComp…

f99b592

…lete into data-actions

Update to keep up with changes in streetcomplete#3665

0f2bb15

More updates to keep up with changes in streetcomplete#3665

b32a5eb

Fix up some more bits which had been missed

b293cb4

Keep one delete style which is currently used

7dc4d70

Merge branch 'master' of https://github.com/streetcomplete/StreetComp…

75ac156

…lete into data-actions

Remove the delete again now streetcomplete#3682 is in

e7135f0

Update getQuestAnswerType now streetcomplete#3682 is in

ed46e1e

Handle alias names for enums

43a1f8a

Fix a load more issues and improve our parsing

e3f1d93

Merge branch 'master' of https://github.com/streetcomplete/StreetComp…

2b51b09

…lete into data-actions

peternewman added 2 commits January 28, 2022 15:43

Fix the broken merge

70ec8be

Handle main part of Orchard and Sport quests with joinToString. Also …

a388043

….toCheckDateString

peternewman mentioned this pull request Feb 9, 2022

Make the questName regex more robust #3739

Closed

westnordost closed this Apr 10, 2022

matkoniecz mentioned this pull request Jul 20, 2022

Taginfo listing of tags added or modified by StreetComplete #4225

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial start on auto-generating taginfo JSON #2754

Initial start on auto-generating taginfo JSON #2754

peternewman commented Apr 16, 2021 •

edited

Loading

westnordost commented Apr 16, 2021

matkoniecz commented Apr 16, 2021

peternewman commented Apr 16, 2021

peternewman commented Apr 16, 2021

matkoniecz commented Apr 17, 2021 •

edited

Loading

westnordost commented Jun 6, 2021

peternewman commented Jun 7, 2021

peternewman commented Jun 7, 2021

peternewman commented Jun 23, 2021 •

edited

Loading

matkoniecz commented Dec 13, 2021

matkoniecz Dec 13, 2021

peternewman Jan 28, 2022

matkoniecz Dec 13, 2021

peternewman Jan 28, 2022

matkoniecz commented Dec 13, 2021 •

edited

Loading

smichel17 commented Dec 13, 2021

peternewman commented Jan 28, 2022

westnordost commented Feb 18, 2022

westnordost commented Apr 10, 2022

westnordost commented Apr 10, 2022 •

edited

Loading

Initial start on auto-generating taginfo JSON #2754

Initial start on auto-generating taginfo JSON #2754

Conversation

peternewman commented Apr 16, 2021 • edited Loading

westnordost commented Apr 16, 2021

matkoniecz commented Apr 16, 2021

peternewman commented Apr 16, 2021

peternewman commented Apr 16, 2021

matkoniecz commented Apr 17, 2021 • edited Loading

westnordost commented Jun 6, 2021

peternewman commented Jun 7, 2021

peternewman commented Jun 7, 2021

peternewman commented Jun 23, 2021 • edited Loading

matkoniecz commented Dec 13, 2021

matkoniecz Dec 13, 2021

Choose a reason for hiding this comment

peternewman Jan 28, 2022

Choose a reason for hiding this comment

matkoniecz Dec 13, 2021

Choose a reason for hiding this comment

peternewman Jan 28, 2022

Choose a reason for hiding this comment

matkoniecz commented Dec 13, 2021 • edited Loading

smichel17 commented Dec 13, 2021

peternewman commented Jan 28, 2022

westnordost commented Feb 18, 2022

westnordost commented Apr 10, 2022

westnordost commented Apr 10, 2022 • edited Loading

peternewman commented Apr 16, 2021 •

edited

Loading

matkoniecz commented Apr 17, 2021 •

edited

Loading

peternewman commented Jun 23, 2021 •

edited

Loading

matkoniecz commented Dec 13, 2021 •

edited

Loading

westnordost commented Apr 10, 2022 •

edited

Loading