-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tika: image parser fails in JVM mode with java.lang.LinkageError #8375
Comments
@JiriOndrusek Thanks, unfortunately not all parsers are native ready, or more precisely, probably many of them are, but since there are many of them, I don't know for sure, I'll need to commit to updating the text. And we need to keep fixing the native issues related to specific parsers. I know OOXML parsers have problem, Image parser is another one. |
There are 69 parsers :-) and a huge number of supported formats. |
@stuartwdouglas Hi Stuart, do you have ideas why would @JiriOndrusek Can you give me a favour please and try the same reproducer but without configuring anything in |
This happens if a deployment contains part of the XML parser API but not all of it. Some elements may be loaded from the system class loader, and others from QuarkusClassLoader. |
@stuartwdouglas tnanks; I wonder if |
Hi @sberyozkin , To be sure, I've tested it on quarkus quickstart: https://github.com/quarkusio/quarkus-quickstarts/tree/master/tika-quickstart
I suspect that 1+ parsers which are by default used (among these 60+ parsers) are not compatible with native quarkus. (That is the reason, I've used application.properties to select parsers which will be used)I've tried xml and office parsers - they are working correctly. then I've used the image parser and it fails (so it's possible that imageParser is one of the broken ones, but there could be more of them) |
Hi @JiriOndrusek np at all and thanks for the interesting feedback.
Which parsers are these ? See #6549 to do with OOXML parsers failing in native. What may also be happening is that some code path in some parser is only activated if a doc is more complex etc Thanks |
@sberyozkin
|
@JiriOndrusek Thanks, OfficeParser is based on poi so #6549 may be specific to some specific |
Any progress? |
Issue could be closed. I rerun the original test (disabled because of this error) in camel-quarkus and the test works in JVM and native. Because this issue is almost 3 years old, I suppose that some change affected it in positive way. |
Describe the bug
According to the documentation of quarkus tika, almost all parsers should be native ready if they are correctly registered.
There are several excluded ones (see https://github.com/quarkusio/quarkus/blob/master/extensions/tika/deployment/src/main/java/io/quarkus/tika/deployment/TikaProcessor.java#L39)
Previous text states that eg. imageParser should work correctly
I've tried to use imageParser in native, but it was failing even in JVM.
Expected behavior
Should work in JVM and native mode.
To Reproduce
Steps to reproduce the behavior:
I've created small reproducer
Error log
The text was updated successfully, but these errors were encountered: