-
-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add MathML support when importing PubMed #4273
Comments
I could confirm this in the latest master. The problems seems to be that our parser is not aware of the italic annotation things in the xml.
I tried a bit around but I can't get it working. I am not an XML expert, but somehow the math namespace can't be resolved. |
7 months ago and of low priority! I understand that you are under considerable stress, but medline import for some of us is of utmost importance. |
There seems to be a new one and one needs to have a bunch of addtional xml files. mathml-in-pubmed.mod @bernhard-kleine Well, Pubmed is generally working, e.g import of entries is possible. Only for a subset of entries which uses italics in the title or abstract the import is not 100% correct. That's probably why it was labeled as As a side note, the EUtiuilies stuff seems to be able to export json as well, so this m ght be an alternative. |
Unfortunately still a problem. The jaxb needs handling of math xml somehow |
This is still a problem (checked today with v5.3 portable on Windows 10). Did not check if there was any trouble with the Firefox extension though. I understand that the persons maintaining JabRef have other priorities. I think this issue could be closed but then, it would be fair to deactivate the Medline / Pubmed fetcher and explicitely mention that importing Pubmed results in JabRef can be manually done through an nbib file generated from the Pubmed website ("Send to" > "Citation manager"). |
This is still a problem (checked today with v5.7 portable on Windows 10). Closing this issue, considering it as "wontfix" |
Sorry, I have been recently looking into this issue again. Unfortuantely I was not able to get the xml parsing work for this case. However, I would still leave this open for the future |
Hi @Siedlerchr, in this issue the StaX approach works for italics, underline and bold, and we can extract just the text by ignoring those tags. For the MathML tags are we looking for a full MathML to LaTeX conversion? Or are we just interested in extracting the character elements within the e.g. the following would convert to <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML">
<mml:mrow>
<mml:msubsup>
<mml:mi>η</mml:mi>
<mml:mi>p</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math> |
That's great to here! For mathml, not sure, I found an xslt transform script for mathml Maybe that helps or can be used somehow, otherwise I would really go with the plain character parsing as in your exampole |
@vchouraki The italics issue was solved. Please check the latest build in https://builds.jabref.org/main/. I renamed this issue, because we are discussing MathML now. - I hope the MathML support won't be a rabbit whole (see #6155). |
Please assign this issue to me |
Hello,
Tasks
JabRef version 4.3.1 on Ubuntu 18.04 with jre1.8.0_181 downloaded from oracle website
Steps to reproduce:
Example using above "search"
Original title :
Imported title :
Original abstract :
Imported abstract :
Best,
Vincent
The text was updated successfully, but these errors were encountered: