Bash Script is super slow

后端 未结 2 1578
深忆病人
深忆病人 2021-01-06 22:30

I\'m updating an old script to parse ARP data and get useful information out of it. We added a new router and while I can pull the ARP data out of the router it\'s in a new

相关标签:
2条回答
  • 2021-01-06 23:05

    When writing shell scripts, it’s almost always better to call a function directly rather than using a subshell to call the function. The usual convention that I’ve seen is to echo the return value of the function and capture that output using a subshell. For example:

    #!/bin/bash
    function get_path() {
        echo "/path/to/something"
    }
    mypath="$(get_path)"
    

    This works fine, but there is a significant speed overhead to using a subshell and there is a much faster alternative. Instead, you can just have a convention wherein a particular variable is always the return value of the function (I use retval). This has the added benefit of also allowing you to return arrays from your functions.

    If you don’t know what a subshell is, for the purposes of this blog post a subshell is another bash shell which is spawned whenever you use $() or `` and is used to execute the code you put inside.

    I did some simple testing to allow you to observe the overhead. For two functionally equivalent scripts:

    This one uses a subshell:

    #!/bin/bash
    function a() {
        echo hello
    }
    for (( i = 0; i < 10000; i++ )); do
        echo "$(a)"
    done
    

    This one uses a variable:

    #!/bin/bash
    function a() {
        retval="hello"
    }
    for (( i = 0; i < 10000; i++ )); do
        a
        echo "$retval"
    done
    

    The speed difference between these two is noticeable and significant.

    $ for i in variable subshell; do
    > echo -e "\n$i"; time ./$i > /dev/null
    > done
    
    variable
    
    real 0m0.367s
    user 0m0.346s
    sys 0m0.015s
    
    subshell
    
    real 0m11.937s
    user 0m3.121s
    sys 0m0.359s
    

    As you can see, when using variable, execution takes 0.367 seconds. subshell however takes a full 11.937 seconds!

    Source: http://rus.har.mn/blog/2010-07-05/subshells/

    0 讨论(0)
  • 2021-01-06 23:21
    1. While read loops are slow.
    2. Subshells in a loop are slow.
    3. >> (open(f, 'a')) calls in a loop are slow.

    You could speed this up and remain in pure bash, just by losing #2 and #3:

    #!/usr/bin/env bash
    
    while read -a line; do
        case "${#line[@]}" in
            6) printf '%s %s %s\n' "${line[1]}" "${line[3]}" "${line[5]}";;
            4) printf '%s %s %s\n' "${line[0]}" "${line[2]}" "${line[3]}";;
        esac
    done < zTempMonth >> zTempMonth.tmp
    

    But if there are more than a few lines, this will still be slower than pure awk. Consider an awk script as simple as this:

    BEGIN {
        print "Parsing zTempMonth"
    }   
    
    NF == 6 {
        print $2 " " $4 " " $6
    }   
    
    NF == 4 {
        print $1 " " $3 " " $4
    }   
    

    You could execute it like this:

    awk -f thatAwkScript zTempMonth >> zTempMonth.tmp
    

    to get the same append approach as your current script.

    0 讨论(0)
提交回复
热议问题