You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fresh install on Ubuntu, using 'sudo apt-get install libtika-java' and 'sudo pip3 install tika',
Parsing any pdf file with 'parsed = parser.from_file("a.pdf")' fails with
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/usr/local/lib/python3.6/dist-packages/tika/parser.py", line 36, in from_file
output = parse1(service, filename, serverEndpoint, headers=headers, config_path=config_path, requestOptions=requestOptions)
File "/usr/local/lib/python3.6/dist-packages/tika/tika.py", line 321, in parse1
headers.update({'Accept': responseMimeType, 'Content-Disposition': make_content_disposition_header(path.encode('utf-8') if type(path) is unicode_string else path)})
File "/usr/local/lib/python3.6/dist-packages/tika/tika.py", line 126, in make_content_disposition_header
return build_header(os.path.basename(fn)).decode('ascii')
File "/usr/local/lib/python3.6/dist-packages/rfc6266.py", line 430, in build_header
if is_token(filename):
File "/usr/local/lib/python3.6/dist-packages/rfc6266.py", line 370, in is_token
return all(is_token_char(ch) for ch in candidate)
File "/usr/local/lib/python3.6/dist-packages/rfc6266.py", line 370, in <genexpr>
return all(is_token_char(ch) for ch in candidate)
File "/usr/local/lib/python3.6/dist-packages/rfc6266.py", line 357, in is_token_char
asciicode = ord(ch)
TypeError: ord() expected string of length 1, but int found
The text was updated successfully, but these errors were encountered:
Fresh install on Ubuntu, using 'sudo apt-get install libtika-java' and 'sudo pip3 install tika',
Parsing any pdf file with 'parsed = parser.from_file("a.pdf")' fails with
The text was updated successfully, but these errors were encountered: