How to get the max memory usage of a program using psutil in Python

后端 未结 2 713
无人及你
无人及你 2020-12-03 20:04

I am using the following code to get the max memory usage of the program.

    import os, subprocess , psutil
    def mem(cmd):
        try:
            with          


        
相关标签:
2条回答
  • 2020-12-03 20:47

    <defunct> means that the subprocess is a zombie process (it is dead but its status has not been read yet by the parent (p.poll() or p.wait())). It seems both psutil and ps shows RSS to be zero for such processes.

    The result depends on whether the subprocess will exit sooner than p.memory_info() is called. It is a race. If you add a delay at the exit in the C++ program then p.memory_info() may be called before the subprocess exits and you should get non-zero results.

    The problem is that I can get arbitrary programs to evaluate . The language is also not fixed. Isn't there an elegant solution to this problem?

    You might need OS support to save info about the memory usage by a subprocess even after it exits. Or you could run the program using a memory profiler such as valgrind and read its results. To collect results:

    $ valgrind --tool=massif cmd arg1 arg2 
    

    To see the results, you could use ms_print:

    $ ms_print massif.out.* | less
    

    Or GUI Massif-Visualizer

    @mdadm suggested a simpler solution: time command:

    from subprocess import Popen, PIPE
    
    p = Popen(['time', '-f', '%M'] + args, stderr=PIPE)
    ru_maxrss = int(p.communicate()[1])
    print("Maximum rss %d KB" % ru_maxrss)
    

    GNU time uses wait3() to populate resource usage info if it is available. It can be called in Python:

    import os
    from subprocess import Popen
    
    p = Popen(args)
    ru = os.wait4(p.pid, 0)[2]
    print("Maximum rss %d KB" % ru.ru_maxrss)
    

    I've compared the maximum value returned by psutil.Process.memory_info (rss) with ru_maxrss value returned by os.wait4 and with the maximum total memory reported by valgrind --tool=massif: they are similar.

    See also:

    • A way to determine a process's “real” memory usage, i.e. private dirty RSS?
    • How is memory usage reported in Linux?
    • Find maximum memory consumed by a process specified at the command-line.
    0 讨论(0)
  • 2020-12-03 20:50

    The problem here is that psutils takes a quick snapshot from the /proc filesystem, as you can see in the source.

    When you run your hello world example, in some cases it finishes before python gets a chance to read the values from /proc.

    Once the process is finished, it effectively no longer consumes any ram. You can confirm this with an strace.

    open("/proc/13420/statm", O_RDONLY)     = 3
    read(3, "0 0 0 0 0 0 0\n", 1024)        = 14
    

    If you modify your example to use something like sleep, you'll notice that psutils consistently gives memory usage back.

    #include <iostream>
    #include <unistd.h>
    
    int main()
    {
      std::cout << "Hello World.. sleeping!";
      sleep(3);
    }
    

    Output of your python script...

    a.out
    meminfo(rss=286720, vms=12931072)
    

    One simple way to accomplish what you are trying to do, is by using the /usr/bin/time command, which on most platforms will give you the average total memory usage of the process you launch OR use valgrind as J.F Sebastian suggests... whom posted as I was researching and testing my answer ;)

    Hello World.. sleeping!0.00user 0.00system 0:03.00elapsed 0%CPU     
    (0avgtext+0avgdata 1144maxresident)k
    0inputs+0outputs (0major+348minor)pagefaults 0swaps
    
    0 讨论(0)
提交回复
热议问题