-
-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support for comma delimited housenumber + street #29
Comments
Off the top of my head there are two ways to accomplish this:
|
One other approach would be to check for a |
missinglink
added a commit
that referenced
this issue
May 27, 2019
missinglink
added a commit
that referenced
this issue
May 27, 2019
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I've seen a few cases internationally where users insert a comma between every component of the address, I'm not sure if this is done manually or when joining cells in a spreadsheet.
This is actually great for most tokens because it helps us to avoid parsing ambiguities.
The issue is when used between the
housenumber
and thestreet
so the parser will fail for an address such as:
but pass for one where the first comma is not present:
The code responsible for this is the
TokenDistanceFilter
, which should be modified to ignore section boundaries when considering adjacency.The text was updated successfully, but these errors were encountered: