What killed my process and why?

后端 未结 14 976
误落风尘
误落风尘 2020-11-22 09:34

My application runs as a background process on Linux. It is currently started at the command line in a Terminal window.

Recently a user was executing the application

相关标签:
14条回答
  • 2020-11-22 09:57

    I encountered this problem lately. Finally, I found my processes were killed just after Opensuse zypper update was called automatically. To disable zypper update solved my problem.

    0 讨论(0)
  • 2020-11-22 09:59

    In an lsf environment (interactive or otherwise) if the application exceeds memory utilization beyond some preset threshold by the admins on the queue or the resource request in submit to the queue the processes will be killed so other users don't fall victim to a potential run away. It doesn't always send an email when it does so, depending on how its set up.

    One solution in this case is to find a queue with larger resources or define larger resource requirements in the submission.

    You may also want to review man ulimit

    Although I don't remember ulimit resulting in Killed its been a while since I needed that.

    0 讨论(0)
  • 2020-11-22 09:59

    Solved this issue by increasing swap size:

    https://askubuntu.com/questions/1075505/how-do-i-increase-swapfile-in-ubuntu-18-04

    0 讨论(0)
  • 2020-11-22 10:03

    Let me first explain when and why OOMKiller get invoked?

    Say you have 512 RAM + 1GB Swap memory. So in theory, your CPU has access to total of 1.5GB of virtual memory.

    Now, for some time everything is running fine within 1.5GB of total memory. But all of sudden (or gradually) your system has started consuming more and more memory and it reached at a point around 95% of total memory used.

    Now say any process has requested large chunck of memory from the kernel. Kernel check for the available memory and find that there is no way it can allocate your process more memory. So it will try to free some memory calling/invoking OOMKiller (http://linux-mm.org/OOM).

    OOMKiller has its own algorithm to score the rank for every process. Typically which process uses more memory becomes the victim to be killed.

    Where can I find logs of OOMKiller?

    Typically in /var/log directory. Either /var/log/kern.log or /var/log/dmesg

    Hope this will help you.

    Some typical solutions:

    1. Increase memory (not swap)
    2. Find the memory leaks in your program and fix them
    3. Restrict memory any process can consume (for example JVM memory can be restricted using JAVA_OPTS)
    4. See the logs and google :)
    0 讨论(0)
  • 2020-11-22 10:04

    The PAM module to limit resources caused exactly the results you described: My process died mysteriously with the text Killed on the console window. No log output, neither in syslog nor in kern.log. The top program helped me to discover that exactly after one minute of CPU usage my process gets killed.

    0 讨论(0)
  • 2020-11-22 10:05

    A tool like systemtap (or a tracer) can monitor kernel signal-transmission logic and report. e.g., https://sourceware.org/systemtap/examples/process/sigmon.stp

    # stap .../sigmon.stp -x 31994 SIGKILL
       SPID     SNAME            RPID  RNAME            SIGNUM SIGNAME
       5609     bash             31994 find             9      SIGKILL
    

    The filtering if block in that script can be adjusted to taste, or eliminated to trace systemwide signal traffic. Causes can be further isolated by collecting backtraces (add a print_backtrace() and/or print_ubacktrace() to the probe, for kernel- and userspace- respectively).

    0 讨论(0)
提交回复
热议问题