-
-
Notifications
You must be signed in to change notification settings - Fork 359
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use Overture POIs as a shop data source #5199
Comments
Hm, I have a few reservations against that, I tend towards closing this as will-not-fix:
Of course, the last point could also be said for any quest as it would probably be more efficient to record all the things with EveryDoor et al. 🤷♀️ |
With that said, if you don't think this would be helpful, feel free to close it. |
@tsmock you might be interested in SCEE ("expert edition" fork of StreetComplete) which has You would still need to deduplicate Overpass/OSM (optional; don't need to do that if you prefer extra manual verification work over writing deduplication code) & convert Overpass data to that If you do decide to try that; do share your experiences (and converting scripts) afterward; other people might be interested in that too. |
@tsmock In which areas Overture data would overall save time on inputting data? In areas I checked and know data quality is low enough that repairing data would take more time than mapping from scratch. It seems, at most, to be useful as detector of undermapped areas. Or as data source if you prefer its deficiencies over OSM data issues with coverage. |
Also, has anyone even managed to decently deduplicate OSM and Overture data? Looking at it I have no idea how to do it reliably given quality of data there. |
@tsmock such a quest could be added in SCEE. But having a way to communicate back to the server when something was added or not found at the indicated location is definitely necessary. There was a previous request of doing something similar using Osmose fixables, but they can only be downloaded for individual issues. |
https://bdon.github.io/overture-tiles/places.html#0.92/0/0 can be used to judge data quality in my area at least (Kraków, Poland) data quality is laughably bad |
Wow... there really is room for improvement. |
I selected a confidence of >= 0.6 and checked some POIs on a random location near me that I personally know (Hamburg).
At confidence >= 0.9, only a few places remain:
(The OSM data has 43 POIs at the same map excerpt of which all are currently correct) |
Are those numbers for POIs which exist in Overture but are missing from OSM? |
No, see my last sentencte. Of the map excerpt I looked at, all correct POIs of Overture also existed in OSM. Such a random example in a well-mapped neighbourhood is of course not representative. |
I did the same thing for Main Street near where I live, although I only looked at 3 blocks on the south side. I've gone up and down it several times with SC (mostly north side), but I almost never add new businesses via SC. I've done it once, but I didn't enjoy the experience. I am much more likely to take a picture and make a note for later mapping.
So ~2/3 (23/38) of the POIs in my test area are "good" before filtering based off of the
I'll note that I'm currently planning on making the POIs available in JOSM (through the MapWithAI plugin), and I'll probably be doing some client-side conflation and translation. For conflation, I think doing the translation first and then conflating based off of primary tags (primarily the name) will work the best. |
So anyway, I think I will close this here but feel free to continue commenting. If this is implemented, it may make more sense to put this into SCEE, at least unless the concerns I posted are to a degree disproven in actual use. |
I guess one could see if in the nearby radius there is either a POI with exact same name, or POI with similar name and same type, and remove such Overture POI as (possible) duplicate (note that luckily one doesn't have to very precise here: wrong guess won't result in wrong data in OSM, just in reducing potential benefit slightly). I have doubts whether importing from Overture dataset is good idea though (from business perspective helping Overture which might negatively impact OpenStreetMap, as well as potential legal contamination with Overture data - ODbL notwithstanding) |
The problem is that names are very often divergent (only some due top garbage data - sometimes place has interpretable name). And categorization in Overture extremely often utterly mismatches reality. |
I agree, but that is kind of a point of this idea that such ambiguous Overture POIs should be verified by on-the-ground mapper (instead of just doing the regular data import with some conflator, which we could do if the data were high-quality instead) But as said, I do not see support for such dataset as priority (I'd rank it little below |
my point is that it is laborious enough to filter and verify to be not useful at all (at this point, in areas I tested it has pretty low quality) |
I find the slow part of entering POIs to be all the typing. If I have to do that anyways because Overture names need changing, it doesn't seem worth it, even aside from all the other quality issues. |
General
Affected tag(s) to be modified/added: Depends upon the business. We will need to have a mapping of overture poi type -> osm tags. Some people have already started work on that here: https://wiki.openstreetmap.org/wiki/Overture_Categories .
Question asked: Does this
<business type>
named<business name>
exist here?Checklist
Checklist for quest suggestions (see guidelines):
We would need a server for this. Or some kind of feedback API provided by the overture folks.
This is doable, but would need to have some kind of deduplication (e.g.,
searchNodes(bbox(point, expandByXMeters)).stream().filter(OsmPrimitive::isTagged).noneMatch(p -> p.containsTags(point.getTags())
). From what I understand, SC downloads an area now.Server-side deduplication would be better, or we can download smaller areas for the POIs (e.g., only a z17 tile instead of a z15 tile).
Depends upon who you ask; most people would say that POIs are kind of important.
Ideas for implementation
Element selection:
Metadata needed:
Proposed UI:
[pin on map]
Does this
<business type>
named<business name>
exist here?| Other Answers... | No | Yes |
Other Answers...
opening_hours=closed
)The text was updated successfully, but these errors were encountered: