You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I don't know how useful general "multi-character" delimiter support is, but one specific type of it that seems useful is "whitespace delimited", meaning any whitespace (possibly multiple / different whitespace characters).
In pandas you can achieve this either by passing delimiter="\s+" or specifying delim_whitespace=True (and both are equivalent, pandas special cases delimiter="\s+" as any other multi-character delimiter is interpreted as an actual regex and triggers the slower python engine intead of using the default c engine)
Joris Van den Bossche / @jorisvandenbossche:
The "general multi-character" support I don't really know, but the specific case of "whitespace delimiter" certainly is. For example files that uses multiple spaces to have some alignment of columns in the plain text is not uncommon I think.
I don't know how useful general "multi-character" delimiter support is, but one specific type of it that seems useful is "whitespace delimited", meaning any whitespace (possibly multiple / different whitespace characters).
In pandas you can achieve this either by passing
delimiter="\s+"
or specifyingdelim_whitespace=True
(and both are equivalent, pandas special casesdelimiter="\s+"
as any other multi-character delimiter is interpreted as an actual regex and triggers the slower python engine intead of using the default c engine)cc @pitrou @nealrichardson
Reporter: Joris Van den Bossche / @jorisvandenbossche
Note: This issue was originally created as ARROW-10432. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: