CSV parser fails when trying to import XLSX #108

funkybob · 2013-08-19T04:54:31Z

Using 0.9.11 I get the following trying to import a XLSX file...

Traceback (most recent call last):
File "...", line 81, in parse
self.source_data = tablib.import_set(source)
File ".../site-packages/tablib/core.py", line 1006, in import_set
format.import_set(data, stream)
File ".../site-packages/tablib/formats/_csv.py", line 41, in import_set
for i, row in enumerate(rows):
File ".../site-packages/tablib/packages/unicodecsv/__init__.py", line 54, in next
row = self.reader.next()
Error: line contains NULL byte

The text was updated successfully, but these errors were encountered:

funkybob · 2013-08-19T04:55:46Z

I just tried master, and it's now the yaml parser failing with:

File ".../site-packages/tablib/packages/yaml/reader.py", line 200, in update
exc.encoding, exc.reason)
ReaderError: 'utf8' codec can't decode byte #x8e: invalid start byte in "<string>", position 10

funkybob · 2013-08-19T05:43:54Z

So basically, it seems the parsers aren't failing cleanly when encoding is the reason they fail...

djrobstep · 2013-08-27T08:36:30Z

YAML parser also breaks when trying to import_set with a tsv.

yaml.scanner.ScannerError: while scanning for the next token
found character '\t' that cannot start any token

funkybob · 2013-08-27T11:20:19Z

Sadly, this and the pickling bug mean I've had to abandon my use of this library.

kennethreitz · 2014-01-08T19:50:27Z

This project is in a bit of a crisis state — it's really useful, and I use regularly. However, I wrote it several years ago and haven't touched it since. In order to get the project into a stable state I'm closing all issues and pull requests to get a "fresh slate"

Don't take this as aggressive — it's just necessary for the project to make any progress any time soon (it's pretty clear the project is effectively unmaintained at the moment). Great things to come! Please watch the GitHub logs and feel free to re-open this discussion soon. I just need to really it into a good state first.

✨ ❤️ ✨

iurisilvio · 2014-05-01T15:52:52Z

Reopening this issue because it is a real bug and should be fixed.

Should the import_set just ignore exceptions if any formatter accept the input?

iurisilvio · 2014-09-06T12:49:46Z

We have to ignore exceptions when we don't know the format.

Autodetection was added for the odf format.

claudep · 2019-10-04T21:41:51Z

I cannot say I solve all issues, but tablib should be a little more robust now wrt autodetection. Feel free to open new tickets if you can reproduce crashes on master.

kennethreitz closed this as completed Jan 8, 2014

iurisilvio added the crisis label Apr 23, 2014

iurisilvio reopened this May 1, 2014

iurisilvio removed the crisis label May 1, 2014

iurisilvio added the bug label Sep 6, 2014

claudep added a commit to claudep/tablib that referenced this issue Oct 4, 2019

Refs jazzband#108 - Test and improve format autodetection

1a5daf6

Autodetection was added for the odf format.

claudep added a commit to claudep/tablib that referenced this issue Oct 4, 2019

Refs jazzband#108 - Test and improve format autodetection

55c41f3

Autodetection was added for the odf format.

claudep added a commit that referenced this issue Oct 4, 2019

Refs #108 - Test and improve format autodetection

ca8dbcf

Autodetection was added for the odf format.

claudep closed this as completed Oct 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CSV parser fails when trying to import XLSX #108

CSV parser fails when trying to import XLSX #108

funkybob commented Aug 19, 2013

funkybob commented Aug 19, 2013

funkybob commented Aug 19, 2013

djrobstep commented Aug 27, 2013

funkybob commented Aug 27, 2013

kennethreitz commented Jan 8, 2014

iurisilvio commented May 1, 2014

iurisilvio commented Sep 6, 2014

claudep commented Oct 4, 2019

CSV parser fails when trying to import XLSX #108

CSV parser fails when trying to import XLSX #108

Comments

funkybob commented Aug 19, 2013

funkybob commented Aug 19, 2013

funkybob commented Aug 19, 2013

djrobstep commented Aug 27, 2013

funkybob commented Aug 27, 2013

kennethreitz commented Jan 8, 2014

iurisilvio commented May 1, 2014

iurisilvio commented Sep 6, 2014

claudep commented Oct 4, 2019