-
Notifications
You must be signed in to change notification settings - Fork 129
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parallel make randomly hangs #1148
Comments
I don't know if this helps narrow down things, but I just switched from |
Correction: these hangs/crashes don’t occur with the future backend, just clustermq. I do note that the future backend seems far slower; in my system monitor it seems to only have a 3-4 processes active at a time despite requesting 12 jobs (on a 6-core hyperthreading cpu), while with clustermq I get precisely the number of requested jobs active. |
Would it be possible to post a |
Looks like this is due to |
Thanks for looking into this! Would you still like to see the worker logs? |
I wouldn't work too hard on it. The logs might help later on, so if it won't take too much work on your end, then sure. Otherwise, I think we can wait for developments in |
For posterity, this is now fixed by setting a high timeout via |
Prework
drake
's code of conduct.remotes::install_github("ropensci/drake")
) and mention the SHA-1 hash of the Git commit you install.Description
Apologies for the absent reprex, but I'm having trouble creating one. I have a plan with lots of dynamic targets that runs fine in serial but whenever I try to run in parallel I encounter hangs at seemingly random stages of the make build. This occurs both in RStudio and also running R from the unix command line, and occurs
with both parallel backend optionswith only theclustermq
parallel backend. When I keep an eye on my system resource usage during the build, right before the hang the memory usage jumps up dramatically to 100%. When I kill and rundrake::make
again, the target that was computing at the moment of the hang will re-start and memory usually then just stays at regular levels (50%-ish) until I encounter a hang in a random later target. If you have any suggestions on how I might build a reprex for this one or further investigate what's going awry, I'm happy to provide more info.Session info
Created on 2020-01-24 by the reprex package (v0.3.0)
The text was updated successfully, but these errors were encountered: