-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[R] Progress bar for read_feather
for R
and a verbose version
#43404
Comments
read_feather
for R
and a verbose versionread_feather
for R
and a verbose version
I think this is a great idea @ajinkya-k though I'm not sure how feasible this is as this has been discussed in relation to another piece of functionality and we concluded it would be tricky as it'd require non-trivial updates to the Arrow C++ library. Out of interest, once you've loaded the file, are you performing further dplyr manipulations? It might be that you get better performance calling |
Thanks for the update @thisisnic . I do a join and a few filters that drop less than 1% of the rows and then collect, but it's still a huge dataset after that, which I plug into a Bayesian model. The Bayesian model does work, it's just that due to DUA constraints I have to keep the file on a network drive and pull from there. And therefore it's hard to figure out if the file is even being loaded at all, i.e. a progress bar will help me figure out if the read is even progressing at all, or if the network throttling means the process is hung up. |
Ah, that makes sense, doesn't sound like there's much else to suggest in terms of temporary workarounds then! |
@thisisnic I ran the code a few more times and it turns out that the |
I believe I've seen issues with this kind of thing on Windows reading across a network drive though unsure - could be worth comparing with a local file to test. |
yeah unfortunately cant make a copy of the data on my local machine due to DUA constraints. I might try an opensource dataset to test this though. Any suggestions for dataset? |
Describe the enhancement requested
I would like to request a that a progress bar be shown when using
read_feather
function inR
especially for large files, so that the user can see if the file is actually being read and progress is being read, similar todata.table::fread
which shows a simple progress bar enabled using theshowProgress
argument infread
. I have a use case in which I am usingread_feather
to read a large file intoR
from a network drive, and there is no indication ifR
is even making progress on loading the file during some runs. In others it loads in ~300 seconds.fread
also has a verbose option which dumps a lot more output, and would also be well worth implementing, but a progress bar at minimum would also be great!Component(s)
R
The text was updated successfully, but these errors were encountered: