-
-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
inconsistent behavior: accents in search #6815
Comments
I'll start looking into this |
I can't replicate ("Lefevre" does not match "Lef{`e}vre, V." for me). |
I tried both ISO-8859-1 and UTF-8. Language is English, library mode is BibTeX.
I double checked the file type of all files: it's "BibTeX text file, ASCII text" according to the file command. |
@sbecuwe I don't know if you'd be willing to share the original BibTeX file with me (I am NOT a JabRef developer, I am a volunteer). It would be easier to try to pinpoint what is going on using the debugger rather than going through the code. Also, I appreciate that you seem to be spending more time on this issue than you were probably intending to =/ In a nutshell,
|
Related to the whole messy complex of the Latex-To-Unicode converter (#6155). |
|
Hi, may I have a try on this issue? Is it possible to get some guidance on where I should start? Or is there any particular information I should look into? Thanks a lot! |
It is an issue where one needs to think hard what to do. One also needs to craft test cases. Hints on the current code behavior:
For a quick fix, other equivalence checks should be implemented. Java's Collator is a good start. Maybe, you can create a pull request:
|
Thank you so much for your detailed instructions. It really helps a lot! The code uses the contains methods for the String and I didn't find the Collator API has that method, so I searched online, and find Normalizer. It seems that the search performed as expected, but I am not very sure whether it has any underlying issues. I have created a PR for that. Please let me know if this is not a suitable API to use. I will fix it. Thanks again! |
Should be fixed by #8640 thanks to @LingZhang22 . @sbecuwe Could you please test the current dev version (after a backup of your files) if this issue persists? Thanks! |
JabRef version 5.1 on macOS 10.14.6
I have tested the latest development version from http://builds.jabref.org/master/ and the problem persists.
It seems the way accents are handled in author names, is not consistent.
"Lef{\`e}vre, V." is found when entering "Lefevre" in the search field. The same holds for "Pr{\'e}vost, M." when entering "Prevost".
However, "M{\"u}hlbach, G." is not found when entering "Muhlbach" in the search field. The same holds for "D{\'{\i}}az, J." when entering "Diaz", and "Nj{\aa}stad, O." when entering "Njastad" or "Njaastad".
The text was updated successfully, but these errors were encountered: