-
-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Linux Segfaults #4504
Comments
What OS are you on? Can you try to compile from source on the exact OS you're trying to run it on? If you're on Fedora, this is probably a duplicate of #4461 |
I'm on CentOS which is related to fedora. I will coordinate with my cluster
people.
…On Tue, Mar 27, 2018 at 11:34 AM Mauro Bieg ***@***.***> wrote:
What OS are you on? Can you try to compile from source
<https://github.com/jgm/pandoc/blob/master/INSTALL.md#compiling-from-source>
on the exact OS you're trying to run it on?
If you're on Fedora, this is probably a duplicate of #4461
<#4461>
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#4504 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/ADOs6AYrY3D-c2Rq03Q5JLLKgRt6XAN-ks5tilvsgaJpZM4S9DY4>
.
|
I agree, it would be helpful to know if the problem persists with pandoc compiled on the target system. |
Could you please suggest some simple commands with the files to use? It segfaults even if I run On a related note, I'm getting an "error 139" on the working nodes for some files. The command run is as follows:
I couldn't find error 139 anywhere in the source, but I have read that it might be related to segfaults. |
Brian Fulton-Howard <[email protected]> writes:
Could you please suggest some simple commands with the files to use? It segfaults even if I run `pandoc` with no arguments, and it segfaults from rmarkdown.
I was thinking of something like
echo "Hello" | pandoc
On a related note, I'm getting an "error 139" on the working nodes for some files. The command run is as follows:
```
/sc/orga/projects/LOAD/Brian/anaconda3/bin/pandoc +RTS -K512m -RTS Post_imputation.utf8.md --to html --from markdown+autolink_bare_uris+ascii_identifiers+tex_math_single_backslash --output /sc/orga/projects/LOAD/Brian/projects/ADSP/data/pre_impute_merge/impute_stats/stats/CHARGE_CHS_impStats.html --smart --email-obfuscation none --self-contained --standalone --section-divs --template /hpc/packages/minerva-common/rpackages/3.4.3/site-library/rmarkdown/rmd/h/default.html --no-highlight --variable highlightjs=1 --variable 'theme:bootstrap' --include-in-header /tmp/101010540.tmpdir/RtmpomNOqg/rmarkdown-str799f60520417.html --mathjax --variable 'mathjax-url:https://mathjax.rstudio.com/latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML'
```
I couldn't find error 139 anywhere in the source, but I have read that it might be related to segfaults.
We don't use exit code 139 for anything.
You might try simplifying your command line piece by piece to see if you
can isolate something that's correlated with the problem. I assume
these nodes have enough memory to use 512m of stack space (that's what
+RTS -K512m -RTS calls for)? One thing to try is increasing or
decreasing this.
|
So I (with the help of a dedicated sysadmin) have confirmed that when pandoc segfaults immediately, it does so regardless of the simplicity of the command. I have also confirmed that error 139 is added by GHC and means there is a segfault. Error 139 occurs with some files on the Intel processors that don't immediately segfault. It does not occur on our ancient, slow AMD processors. Our AMD servers have 256 GB of memory, and the Intel servers have 64 GB. We installing Haskel Platform 8.2.2 and compiling the latest pandoc from source, as you requested. We will let you know the results when we manage to compile. |
Brian Fulton-Howard <[email protected]> writes:
So I (with the help of a dedicated sysadmin) have confirmed that when pandoc segfaults immediately, it does so regardless of the simplicity of the command.
I have also confirmed that error 139 is added by GHC and means there is a segfault. Error 139 occurs with some files on the Intel processors that don't immediately segfault. It does not occur on our ancient, slow AMD processors.
Our AMD servers have 256 GB of memory, and the Intel servers have 64 GB.
We installing Haskel Platform 8.2.2 and compiling the latest pandoc from source, as you requested. We will let you know the results when we manage to compile.
Great. My prediction is that the natively compiled version will work.
If it still segfaults, then very likely this points to a bug in GHC.
|
After native compilation, it no longer segfaults. Instead, this:
|
Very strange!
You might try compiling with a different version of ghc,
such as the latest, ghc 8.4.1. (Linux binaries are
available.)
If that doesn't fix things, reporting as a GHC bug would
be appreciated, I'm sure.
Brian Fulton-Howard <[email protected]> writes:
… After native compilation, it no longer segfaults. Instead, this:
```
pandoc: internal error: Unable to commit 1048576 bytes of memory
(GHC version 8.2.2 for x86_64_unknown_linux)
Please report this as a GHC bug: http://www.haskell.org/ghc/reportabug
```
--
You are receiving this because you commented.
Reply to this email directly or view it on GitHub:
#4504 (comment)
|
Try run it with |
Relevant ghc ticket: |
Are there any known work arounds to prevent the issue? I'm running into it on several VMs running RHEL 6 with various memory footprints (24GB, 48GB, 52GB, 64GB, etc.) with 50% or more free memory. I can't figure out why it |
Having the same issue. Even when i just run
|
@mahermassoud please give more information: I note that the ghc issue linked above is still open. |
@jgm I believe I'm on redhat because
I'm in a totally new conda environment where i installed r and r studio using No idea which version it is. Let me know if you need more info I got a similar issue installing with |
|
Closing ; the upstream ghc issues has been fixed for a long time. |
We are getting segfaults on some nodes of our cluster, but not others when running several pandoc versions:
Our failing nodes all have one of the following processors:
Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz,
Intel(R) Xeon(R) CPU E5-2643 v2 @ 3.50GHz
However, it consistently segfaults on some of these nodes but not on others.
It does not segfault on our AMD Opteron(TM) Processor 6276s.
Kernel versions are as follows:
2.6.32-358.23.2.el6.x86_64,
2.6.32-696.18.7.el6.x86_64
2.6.32-358.23.2.el6.x86_64
2.6.32-358.6.2.el6.x86_64
All nodes are on CentOS 6.2
This occurs with both the system pandoc (1.19.2.1), a modulecmd version (2.0), and the statically linked, pre-compiled version 2.1.3.
It also segfaults in bash:
[fultob01@interactive5 ~]$ pandoc --version Segmentation fault
With my statically linked pandoc, I get a really nice strace. It looks like pandoc is trying to write to 0x4200000000, which is out of bounds, and bode allows the write but shouldn't, so pandoc segfaults when it attempts to read. I have no idea what the solution is for this, but for now, I'll use mothra or manda. Do you have any idea why bode is allowing pandoc to write to that address?
Here's the
strace
trace:On one of our working nodes, it looks like this:
The text was updated successfully, but these errors were encountered: