-
Notifications
You must be signed in to change notification settings - Fork 186
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test failures #149
Comments
3 will likely be OS related. I get 1/2/4 locally (along with the drawingtotext error) myself. Hoping to spend time today churning through some tickets. Will get tests going first, hopefully is fairly straight forward. |
Pushed update that addresses the other breakages. The Going to close this issue out, but can keep discussing. If you figure something out I'm happy to update the docs for other folks that might run into it. |
Just updated, and 1/2/4 have now gone, but in their place I get three rtf complaints about:
lib/extractors/html.js:76 called from rtf.js:34. All three failing tests are in the same |
hmm, no test failures locally, but think I may know why, I'll dig in later today |
PR above fixed this |
On Linux Mint 18, I forked the project, did a git clone, then
yarn install
. I also did these:Running
yarn test
(npm test
is identical, by the way). I get "5 of 177 tests failed". One is because I haven't installed drawingtotext. Here are the others:textract for .pdf files will properly handle multiple columns:
AssertionError: expected false to be true
textract for .pdf files can handle files with spaces in the name:
AssertionError: expected false to be true
textract for image files will extract text from GIF files:
AssertionError: expected [Error: Error extracting [[ testphoto.gif ]], exec error: Command failed: tesseract /home/darren/Projects/textract/test/files/testphoto.gif /tmp/textract/testphoto quiet
Tesseract Open Source OCR Engine v3.04.01 with Leptonica
Warning in pixReadMemGif: writing to a temp file, not directly to memory
Error in pixReadStreamGif: Can't use giflib-5.1.2; suggest 5.1.1 or earlier
Error in pixReadStream: gif: no pix returned
Error in pixRead: pix not read
Error in pixReadMemGif: pix not read
Error in pixReadMem: gif: no pix returned
Error during processing.
] to be null
fromUrl tests will markdown files:
actual expected
""# This is an h1 ## This is an h2 This__This text has been bolded and italicizeditalicized__ "
(The last one is hard to read without the colour-coding! Basically it is saying the
#
are still there and the underlines are still in there.)The text was updated successfully, but these errors were encountered: