Fastest way to solve chain-calculations

后端 未结 5 826
长发绾君心
长发绾君心 2021-02-04 07:14

I have a input like

string input = \"14 + 2 * 32 / 60 + 43 - 7 + 3 - 1 + 0 * 7 + 87 - 32 / 34\"; 
// up to 10MB string size

int result = Calc(input); // 11


        
相关标签:
5条回答
  • 2021-02-04 07:19

    Edit edit: updated with latest versions by The General and Mirai Mann:

    If you want to know which horse is fastest: race the horses. Here are BenchmarkDotNet results comparing various answers from this question (I have not merged their code into my full example, because that feels wrong - only the numbers are presented) with repeatable but large random input, via:

    static MyTests()
    {
        Random rand = new Random(12345);
        StringBuilder input = new StringBuilder();
        string operators = "+-*/";
        var lastOperator = '+';
        for (int i = 0; i < 1000000; i++)
        {
            var @operator = operators[rand.Next(0, 4)];
            input.Append(rand.Next(lastOperator == '/' ? 1 : 0, 100) + " " + @operator + " ");
            lastOperator = @operator;
        }
        input.Append(rand.Next(0, 100));
        expression = input.ToString();
    }
    private static readonly string expression;
    

    with sanity checks (to check they all do the right thing):

    Original: -1426
    NoSubStrings: -1426
    NoSubStringsUnsafe: -1426
    TheGeneral4: -1426
    MiraiMann1: -1426
    

    we get timings (note: Original is OP's version in the question; NoSubStrings[Unsafe] is my versions from below, and two other versions from other answers by user-name):

    (lower "Mean" is better)

    (note; there is a newer version of Mirai Mann's implementation, but I no longer have things setup to run a new test; but: fair to assume it should be better!)

    Runtime: .NET Framework 4.7 (CLR 4.0.30319.42000), 32bit LegacyJIT-v4.7.2633.0

                 Method |      Mean |     Error |    StdDev |
    ------------------- |----------:|----------:|----------:|
               Original | 104.11 ms | 1.4920 ms | 1.3226 ms |
           NoSubStrings |  21.99 ms | 0.4335 ms | 0.7122 ms |
     NoSubStringsUnsafe |  20.53 ms | 0.4103 ms | 0.6967 ms |
            TheGeneral4 |  15.50 ms | 0.3020 ms | 0.5369 ms |
             MiraiMann1 |  15.54 ms | 0.3096 ms | 0.4133 ms |
    

    Runtime: .NET Framework 4.7 (CLR 4.0.30319.42000), 64bit RyuJIT-v4.7.2633.0

                 Method |      Mean |     Error |    StdDev |    Median |
    ------------------- |----------:|----------:|----------:|----------:|
               Original | 114.15 ms | 1.3142 ms | 1.0974 ms | 114.13 ms |
           NoSubStrings |  21.33 ms | 0.4161 ms | 0.6354 ms |  20.93 ms |
     NoSubStringsUnsafe |  19.24 ms | 0.3832 ms | 0.5245 ms |  19.43 ms |
            TheGeneral4 |  13.97 ms | 0.2795 ms | 0.2745 ms |  13.86 ms |
             MiraiMann1 |  15.60 ms | 0.3090 ms | 0.4125 ms |  15.53 ms |
    

    Runtime: .NET Core 2.1.0-preview1-26116-04 (CoreCLR 4.6.26116.03, CoreFX 4.6.26116.01), 64bit RyuJIT

                 Method |      Mean |     Error |    StdDev |    Median |
    ------------------- |----------:|----------:|----------:|----------:|
               Original | 101.51 ms | 1.7807 ms | 1.5786 ms | 101.44 ms |
           NoSubStrings |  21.36 ms | 0.4281 ms | 0.5414 ms |  21.07 ms |
     NoSubStringsUnsafe |  19.85 ms | 0.4172 ms | 0.6737 ms |  19.80 ms |
            TheGeneral4 |  14.06 ms | 0.2788 ms | 0.3723 ms |  13.82 ms |
             MiraiMann1 |  15.88 ms | 0.3153 ms | 0.5922 ms |  15.45 ms |
    

    Original answer from before I added BenchmarkDotNet:

    If I was trying this, I'd be tempted to have a look at the Span<T> work in 2.1 previews - the point being that a Span<T> can be sliced without allocating (and a string can be converted to a Span<char> without allocating); this would allow the string carving and parsing to be performed without any allocations. However, reducing allocations is not always quite the same thing as raw performance (although they are related), so to know if it was faster: you'd need to race your horses (i.e. compare them).

    If Span<T> isn't an option: you can still do the same thing by tracking an int offset manually and just *never using SubString)

    In either case (string or Span<char>): if your operation only needs to cope with a certain subset of representations of integers, I might be tempted to hand role a custom int.Parse equivalent that doesn't apply as many rules (cultures, alternative layouts, etc), and which works without needing a Substring - for example it could take a string and ref int offset, where the offset is updated to be where the parse stopped (either because it hit an operator or a space), and which worked.

    Something like:

    static class P
    {
        static void Main()
        {
            string input = "14 + 2 * 32 / 60 + 43 - 7 + 3 - 1 + 0 * 7 + 87 - 32 / 34";
    
            var val = Evaluate(input);
            System.Console.WriteLine(val);
        }
        static int Evaluate(string expression)
        {
            int offset = 0;
            SkipSpaces(expression, ref offset);
            int value = ReadInt32(expression, ref offset);
            while(ReadNext(expression, ref offset, out char @operator, out int operand))
            {
                switch(@operator)
                {
                    case '+': value = value + operand; break;
                    case '-': value = value - operand; break;
                    case '*': value = value * operand; break;
                    case '/': value = value / operand; break;
                }
            }
            return value;
        }
        static bool ReadNext(string value, ref int offset,
            out char @operator, out int operand)
        {
            SkipSpaces(value, ref offset);
    
            if(offset >= value.Length)
            {
                @operator = (char)0;
                operand = 0;
                return false;
            }
    
            @operator = value[offset++];
            SkipSpaces(value, ref offset);
    
            if (offset >= value.Length)
            {
                operand = 0;
                return false;
            }
            operand = ReadInt32(value, ref offset);
            return true;
        }
    
        static void SkipSpaces(string value, ref int offset)
        {
            while (offset < value.Length && value[offset] == ' ') offset++;
        }
        static int ReadInt32(string value, ref int offset)
        {
            bool isNeg = false;
            char c = value[offset++];
            int i = (c - '0');
            if(c == '-')
            {
                isNeg = true;
                i = 0;
                // todo: what to do here if `-` is not followed by [0-9]?
            }
    
            while (offset < value.Length && (c = value[offset++]) >= '0' && c <= '9')
                i = (i * 10) + (c - '0');
            return isNeg ? -i : i;
        }
    }
    

    Next, I might consider whether it is worthwhile switching to unsafe to remove the double length checks. So I'd implement it both ways, and test it with something like BenchmarkDotNet to see whether it is worth it.


    Edit: here is is with unsafe added and BenchmarkDotNet usage:

    using BenchmarkDotNet.Attributes;
    using BenchmarkDotNet.Running;
    using System;
    
    static class P
    {
        static void Main()
        {
            var summary = BenchmarkRunner.Run<MyTests>();
            System.Console.WriteLine(summary);
        }
    
    }
    public class MyTests
    {
        const string expression = "14 + 2 * 32 / 60 + 43 - 7 + 3 - 1 + 0 * 7 + 87 - 32 / 34";
        [Benchmark]
        public int Original() => EvalOriginal.Calc(expression);
        [Benchmark]
        public int NoSubStrings() => EvalNoSubStrings.Evaluate(expression);
        [Benchmark]
        public int NoSubStringsUnsafe() => EvalNoSubStringsUnsafe.Evaluate(expression);
    }
    static class EvalOriginal
    {
        public static int Calc(string sInput)
        {
            int iCurrent = sInput.IndexOf(' ');
            int iResult = int.Parse(sInput.Substring(0, iCurrent));
            int iNext = 0;
            while ((iNext = sInput.IndexOf(' ', iCurrent + 4)) != -1)
            {
                iResult = Operate(iResult, sInput[iCurrent + 1], int.Parse(sInput.Substring((iCurrent + 3), iNext - (iCurrent + 2))));
                iCurrent = iNext;
            }
            return Operate(iResult, sInput[iCurrent + 1], int.Parse(sInput.Substring((iCurrent + 3))));
        }
    
        public static int Operate(int iReturn, char cOperator, int iOperant)
        {
            switch (cOperator)
            {
                case '+':
                    return (iReturn + iOperant);
                case '-':
                    return (iReturn - iOperant);
                case '*':
                    return (iReturn * iOperant);
                case '/':
                    return (iReturn / iOperant);
                default:
                    throw new Exception("Error");
            }
        }
    }
    static class EvalNoSubStrings {
        public static int Evaluate(string expression)
        {
            int offset = 0;
            SkipSpaces(expression, ref offset);
            int value = ReadInt32(expression, ref offset);
            while (ReadNext(expression, ref offset, out char @operator, out int operand))
            {
                switch (@operator)
                {
                    case '+': value = value + operand; break;
                    case '-': value = value - operand; break;
                    case '*': value = value * operand; break;
                    case '/': value = value / operand; break;
                    default: throw new Exception("Error");
                }
            }
            return value;
        }
        static bool ReadNext(string value, ref int offset,
            out char @operator, out int operand)
        {
            SkipSpaces(value, ref offset);
    
            if (offset >= value.Length)
            {
                @operator = (char)0;
                operand = 0;
                return false;
            }
    
            @operator = value[offset++];
            SkipSpaces(value, ref offset);
    
            if (offset >= value.Length)
            {
                operand = 0;
                return false;
            }
            operand = ReadInt32(value, ref offset);
            return true;
        }
    
        static void SkipSpaces(string value, ref int offset)
        {
            while (offset < value.Length && value[offset] == ' ') offset++;
        }
        static int ReadInt32(string value, ref int offset)
        {
            bool isNeg = false;
            char c = value[offset++];
            int i = (c - '0');
            if (c == '-')
            {
                isNeg = true;
                i = 0;
            }
    
            while (offset < value.Length && (c = value[offset++]) >= '0' && c <= '9')
                i = (i * 10) + (c - '0');
            return isNeg ? -i : i;
        }
    }
    
    static unsafe class EvalNoSubStringsUnsafe
    {
        public static int Evaluate(string expression)
        {
    
            fixed (char* ptr = expression)
            {
                int len = expression.Length;
                var c = ptr;
                SkipSpaces(ref c, ref len);
                int value = ReadInt32(ref c, ref len);
                while (len > 0 && ReadNext(ref c, ref len, out char @operator, out int operand))
                {
                    switch (@operator)
                    {
                        case '+': value = value + operand; break;
                        case '-': value = value - operand; break;
                        case '*': value = value * operand; break;
                        case '/': value = value / operand; break;
                        default: throw new Exception("Error");
                    }
                }
                return value;
            }
        }
        static bool ReadNext(ref char* c, ref int len,
            out char @operator, out int operand)
        {
            SkipSpaces(ref c, ref len);
    
            if (len-- == 0)
            {
                @operator = (char)0;
                operand = 0;
                return false;
            }
            @operator = *c++;
            SkipSpaces(ref c, ref len);
    
            if (len == 0)
            {
                operand = 0;
                return false;
            }
            operand = ReadInt32(ref c, ref len);
            return true;
        }
    
        static void SkipSpaces(ref char* c, ref int len)
        {
            while (len != 0 && *c == ' ') { c++;len--; }
        }
        static int ReadInt32(ref char* c, ref int len)
        {
            bool isNeg = false;
            char ch = *c++;
            len--;
            int i = (ch - '0');
            if (ch == '-')
            {
                isNeg = true;
                i = 0;
            }
    
            while (len-- != 0 && (ch = *c++) >= '0' && ch <= '9')
                i = (i * 10) + (ch - '0');
            return isNeg ? -i : i;
        }
    }
    
    0 讨论(0)
  • 2021-02-04 07:29

    NOTE

    Per comments, this answer does not give a performant solution. I'll leave it here as there are points to be considered / which may be of interest to others finding this thread in future; but as people have said below, this is far from the most performant solution.


    Original Answer

    The .net framework already supplies a way to handle formulas given as strings:

    var formula = "14 + 2 * 32 / 60 + 43 - 7 + 3 - 1 + 0 * 7 + 87 - 32 / 34";
    var result = new DataTable().Compute(formula, null);
    Console.WriteLine(result); //returns 139.125490196078
    

    Initial feedback based on comments

    Per the comments thread I need to point out some things:

    Does this work the way I've described?

    No; this follows the normal rules of maths.

    I assume that your amended rules are to simplify writing code to handle them, rather than because you want to support a new branch of mathematics? If that's the case, I'd argue against that. People will expect things to behave in a certain way; so you'd have to ensure that anyone sending equations to your code was primed with the knowledge to expect the rules of this new-maths rather than being able to use their existing expectations.

    There isn't an option to change the rules here; so if your requirement is to change the rules of maths, this won't work for you.

    Is this the Fastest Solution

    No. However it should perform well given MS spend a lot of time optimising their code, and so will likely perform faster than any hand-rolled code to do the same (though admittedly this code does a lot more than just support the four main operators; so it's not doing exactly the same).

    Per MatthewWatson's specific comment (i.e. calling the DataTable constructor incurs a significant overhead) you'd want to create and then re-use one instance of this object. Depending on what your solution looks like there are various ways to do that; here's one:

    interface ICalculator //if we use an interface we can easily switch from datatable to some other calulator; useful for testing, or if we wanted to compare different calculators without much recoding
    {
        T Calculate<T>(string expression) where T: struct;
    }
    class DataTableCalculator: ICalculator 
    {
        readonly DataTable dataTable = new DataTable();
        public DataTableCalculator(){}
        public T Calculate<T>(string expression) where T: struct =>
            (T)dataTable.Compute(expression, null);
    }
    
    class Calculator: ICalculator
    {
        static ICalculator internalInstance;
        public Calculator(){}
        public void InitialiseCalculator (ICalculator calculator)
        {
            if (internalInstance != null)
            {
                throw new InvalidOperationException("Calculator has already been initialised");
            }
            internalInstance = calculator;
        }
        public T Calculate<T>(string expression) where T: struct =>
            internalInstance.Calculate<T>(expression);
    }
    
    //then we use it on our code
    void Main()
    {
        var calculator1 = new Calculator();
        calculator1.InitialiseCalculator(new DataTableCalculator());
        var equation = "14 + 2 * 32 / 60 + 43 - 7 + 3 - 1 + 0 * 7 + 87 - 32 / 34"; 
        Console.WriteLine(calculator1.Calculate<double>(equation)); //139.125490196078
        equation = "1 + 2 - 3 + 4";
        Console.WriteLine(calculator1.Calculate<int>(equation)); //4
        calculator1 = null;
        System.GC.Collect(); //in reality we'd pretty much never do this, but just to illustrate that our static variable continues fro the life of the app domain rather than the life of the instance
        var calculator2 = new Calculator();
        //calculator2.InitialiseCalculator(new DataTableCalculator()); //uncomment this and you'll get an error; i.e. the calulator should only be initialised once.
        equation = "1 + 2 - 3 + 4 / 5 * 6 - 7 / 8 + 9";
        Console.WriteLine(calculator2.Calculate<double>(equation)); //12.925
    }
    

    NB: The above solution uses a static variable; some people are against use of statics. For this scenario (i.e. where during the lifetime of the application you're only going to require one way of doing calculations) this is a legitimate use case. If you wanted to support switching the calculator at runtime a different approach would be required.


    Update after Testing & Comparing

    Having run some performance tests:

    • The DataTable.Compute method's biggest problem is that for equations the size of which you're dealing with it throws a StackOverflowException; (i.e. based on your equation generator's loop for (int i = 0; i < 1000000; i++).
    • For a single operation with a smaller equation (i < 1000), the compute method (including constructor and Convert.ToInt32 on the double result) takes almost 100 times longer.
    • for the single operation I also encountered overflow exceptions more often; i.e. because the result of the operations had pushed the value outside the bounds of supported data types...
    • Even if we move the constructor/initialise call outside of the timed area and remove the conversion to int (and run for thousands of iterations to get an average), your solution comes in 3.5 times faster than mine.

    Link to the docs: https://msdn.microsoft.com/en-us/library/system.data.datatable.compute%28v=vs.110%29.aspx?f=255&MSPPError=-2147217396

    0 讨论(0)
  • 2021-02-04 07:29

    Here is a Java fun fact. I implemented the same thing in Java and it runs about 20 times faster than Mirai Mann implementation in C#. On my machine 100M chars input string took about 353 milliseconds.

    Below is the code that creates and tests the result.

    Also, note that while it's a good Java/C# performance tester this is not an optimal solution. A better performance can be achieved by multithreading it. It's possible to calculate portions of the string and then combine the result.

    public class Test {
    
        public static void main(String...args){
            new Test().run();
        }
    
        private void run() {
            long startTime = System.currentTimeMillis();
            Random random = new Random(123);
            int result = 0;
            StringBuilder input = new StringBuilder();
            input.append(random.nextInt(99) + 1);
            while (input.length() < 100_000_000){
                int value = random.nextInt(100);
                switch (random.nextInt(4)){
                    case 0:
                        input.append("-");
                        result -= value;
                        break;
                    case 1: // +
                        input.append("+");
                        result += value;
                        break;
                    case 2:
                        input.append("*");
                        result *= value;
                        break;
                    case 3:
                        input.append("/");
                        while (value == 0){
                            value = random.nextInt(100);
                        }
                        result /= value;
                        break;
                }
                input.append(value);
            }
            String inputData = input.toString();
            System.out.println("Test created in " + (System.currentTimeMillis() - startTime));
    
            startTime = System.currentTimeMillis();
            int testResult = test(inputData);
            System.out.println("Completed in " + (System.currentTimeMillis() - startTime));
    
            if(result != testResult){
                throw new Error("Oops");
            }
        }
    
        private int test(String inputData) {
            char[] input;
            try {
                Field val = String.class.getDeclaredField("value");
                val.setAccessible(true);
                input = (char[]) val.get(inputData);
            } catch (NoSuchFieldException | IllegalAccessException e) {
                throw new Error(e);
            }
            int result = 0;
            int startingI = 1;
            {
                char c = input[0];
                if (c >= '0' && c <= '9') {
                    result += c - '0';
                    c = input[1];
                    if (c >= '0' && c <= '9') {
                        result += (c - '0') * 10;
                        startingI++;
                    }
                }
            }
    
            for (int i = startingI, length = input.length, value=0; i < length; i++) {
                char operation = input[i];
                i++;
                char c = input[i];
                if(c >= '0' && c <= '9'){
                    value += c - '0';
                    c = input[i + 1];
                    if(c >= '0' && c <= '9'){
                        value = value * 10 + (c - '0');
                        i++;
                    }
                }
                switch (operation){
                    case '-':
                        result -= value;
                        break;
                    case '+':
                        result += value;
                        break;
                    case '*':
                        result *= value;
                        break;
                    case '/':
                        result /= value;
                        break;
                }
                value = 0;
            }
    
            return result;
        }
    }
    

    When you read the code then you can see that I used a small hack when converting the string to a char array. I mutated the string in order to avoid additional memory allocations for the char array.

    0 讨论(0)
  • 2021-02-04 07:31

    Update

    My original answer was just a bit of fun late at night trying to put this in unsafe and i failed miserably (actually didn't work at all and was slower). However i decided to give this another shot

    The premise was to make everything inline, to remove as much IL as i could, keep everything in int or char*, and make my code pretty. I further optimized this by removing the switch, Ifs will be ore efficient in this situation, also we can order them in the most logical way. And lastly, if we remove the amount of checks for things we do and assume the input is correct we can remove even more overhead by just assuming things like; if the char is > '0' it must be a number. if its a space we can do some calculations, else it must be an operator.

    This is my last attempt with 10,000,000 calculations run 100 times to get an average, each test does a GC.Collect(); and GC.WaitForPendingFinalizers(); so we are'nt fragmenting the memory

    Results

    Test                          : ms    : Cycles (rough) : Increase
    -------------------------------------------------------------------
    OriginalCalc                  : 1,295 : 4,407,795,584  :
    MarcEvalNoSubStrings          :   241 :   820,660,220  : 437.34%, * 5.32
    MarcEvalNoSubStringsUnsafe    :   206 :   701,980,373  : 528.64%, * 6.28
    MiraiMannCalc1                :   225 :   765,678,062  : 475.55%, * 5.75
    MiraiMannCalc2                :   183 :   623,384,924  : 607.65%, * 7.07
    MyCalc4                       :   156 :   534,190,325  : 730.12%, * 8.30
    MyCalc5                       :   146 :   496,185,459  : 786.98%, * 8.86
    MyCalc6                       :   134 :   455,610,410  : 866.41%, * 9.66
    

    Fastest Code so far

    unsafe int Calc6(ref string expression)
    {
       int res = 0, val = 0, op = 0;
       var isOp = false;
    
       // pin the array
       fixed (char* p = expression)
       {
          // Lets not evaluate this 100 million times
          var max = p + expression.Length;
    
          // lets go straight to the source and just increment the pointer
          for (var i = p; i < max; i++)
          {
             // numbers are the most common thing so lets do a loose
             // basic check for them and push them in to our val
             if (*i >= '0') { val = val * 10 + *i - 48; continue; }
    
             // The second most common thing are spaces
             if (*i == ' ')
             {
                // not every space we need to calculate
                if (!(isOp = !isOp)) continue;
    
                // In this case 4 ifs are more efficient then a switch
                // do the calculation, reset out val and jump out
                if (op == '+') { res += val; val = 0; continue; }
                if (op == '-') { res -= val; val = 0; continue; }
                if (op == '*') { res *= val; val = 0; continue; }
                if (op == '/') { res /= val; val = 0; continue; }
    
                // this is just for the first op
                res = val; val = 0; continue;                
             }
             // anything else is considered an operator
             op = *i;
          }
    
          if (op == '+') return res + val;
          if (op == '-') return res - val;
          if (op == '*') return res * val;
          if (op == '/') return res / val;
    
          throw new IndexOutOfRangeException();
       }
    }
    

    Previous

    unsafe int Calc4(ref string expression)
    {
       int res = 0, val = 0, op = 0;
       var isOp = false;
    
       fixed (char* p = expression)
       {
          var max = p + expression.Length;
          for (var i = p; i < max; i++)
             switch (*i)
             {               
                case ' ':
                   isOp = !isOp;
                   if (!isOp) continue;    
                   switch (op)
                   {
                      case '+': res += val; val = 0; continue;
                      case '-': res -= val; val = 0; continue;
                      case '*': res *= val; val = 0; continue;
                      case '/': res /= val; val = 0; continue;
                      default: res = val; val = 0;  continue;
                   }
                case '+': case '-': case '*': case '/': op = *i; continue;
                default: val = val * 10 + *i - 48; continue;
             }
    
          switch (op)
          {
             case '+': return res + val;
             case '-': return res - val;
             case '*': return res * val;
             case '/': return res / val;
             default : return -1;
          }
       }
    }
    

    How i measured the Thread cycles

    static class NativeMethods {
        public static ulong GetThreadCycles() {
            ulong cycles;
            if (!QueryThreadCycleTime(PseudoHandle, out cycles))
                throw new System.ComponentModel.Win32Exception();
            return cycles;
        }
        [DllImport("kernel32.dll", SetLastError = true)]
        private static extern bool QueryThreadCycleTime(IntPtr hThread, out ulong cycles);
        private static readonly IntPtr PseudoHandle = (IntPtr)(-2);
    
    }
    

    Original Post

    I thought id try to be smart and use fixed :/ and max this out with millions of calculations

    public static unsafe int Calc2(string sInput)
    {
       var buf = "";
       var start = sInput.IndexOf(' ');
       var value1 = int.Parse(sInput.Substring(0, start));
       string op = null;
       var iResult = 0;
       var isOp = false;
       fixed (char* p = sInput)
       {
          for (var i = start + 1; i < sInput.Length; i++)
          {
             var cur = *(p + i);
             if (cur == ' ')
             {
                if (!isOp)
                {
                   op = buf;
                   isOp = true;
                }
                else
                {
                   var value2 = int.Parse(buf);
                   switch (op[0])
                   {
                      case '+': iResult += value1 + value2; break;
                      case '-': iResult += value1 - value2; break;
                      case '*': iResult += value1 * value2; break;
                      case '/': iResult += value1 / value2; break;
                   }
    
                   value1 = value2;
                   isOp = false;
                }
    
                buf = "";
             }
             else
             {
                buf += cur;
             }
          }
       }
    
       return iResult;
    }
    
    private static void Main(string[] args)
    {
       var input = "14 + 2 * 32 / 60 + 43 - 7 + 3 - 1 + 0 * 7 + 87 - 32 / 34";
       var sb = new StringBuilder();
       sb.Append(input);
       for (var i = 0; i < 10000000; i++)
          sb.Append(" + " + input);
    
       var sw = new Stopwatch();
       sw.Start();
    
       Calc2(sb.ToString());
    
       sw.Stop();
    
       Console.WriteLine($"sw : {sw.Elapsed:c}");
    }
    

    Results were 2 seconds slower than the original!

    0 讨论(0)
  • 2021-02-04 07:33

    The following solution is a finite automaton. Calc(input) = O(n). For better performance, this solution does not use IndexOf, Substring, Parse, string concatenation, or repeated reading of value (fetching input[i] more than once)... just a character processor.

        static int Calculate1(string input)
        {
            int acc = 0;
            char last = ' ', operation = '+';
    
            for (int i = 0; i < input.Length; i++)
            {
                var current = input[i];
                switch (current)
                {
                    case ' ':
                        if (last != ' ')
                        {
                            switch (operation)
                            {
                                case '+': acc += last - '0'; break;
                                case '-': acc -= last - '0'; break;
                                case '*': acc *= last - '0'; break;
                                case '/': acc /= last - '0'; break;
                            }
    
                            last = ' ';
                        }
    
                        break;
    
                    case '0': case '1': case '2': case '3': case '4':
                    case '5': case '6': case '7': case '8': case '9':
                        if (last == ' ') last = current;
                        else
                        {
                            var num = (last - '0') * 10 + (current - '0');
                            switch (operation)
                            {
                                case '+': acc += num; break;
                                case '-': acc -= num; break;
                                case '*': acc *= num; break;
                                case '/': acc /= num; break;
                            }
                            last = ' ';
                        }
                        break;
    
                    case '+': case '-': case '*': case '/':
                        operation = current;
                        break;
                }
            }
    
            if (last != ' ')
                switch (operation)
                {
                    case '+': acc += last - '0'; break;
                    case '-': acc -= last - '0'; break;
                    case '*': acc *= last - '0'; break;
                    case '/': acc /= last - '0'; break;
                }
    
            return acc;
        }
    

    And another implementation. It reads 25% less from the input. I expect that it has 25% better performance.

        static int Calculate2(string input)
        {
            int acc = 0, i = 0;
            char last = ' ', operation = '+';
    
            while (i < input.Length)
            {
                var current = input[i];
                switch (current)
                {
                    case ' ':
                        if (last != ' ')
                        {
                            switch (operation)
                            {
                                case '+': acc += last - '0'; break;
                                case '-': acc -= last - '0'; break;
                                case '*': acc *= last - '0'; break;
                                case '/': acc /= last - '0'; break;
                            }
    
                            last = ' ';
                            i++;
                        }
    
                        break;
    
                    case '0': case '1': case '2': case '3': case '4':
                    case '5': case '6': case '7': case '8': case '9':
                        if (last == ' ')
                        {
                            last = current;
                            i++;
                        }
                        else
                        {
                            var num = (last - '0') * 10 + (current - '0');
                            switch (operation)
                            {
                                case '+': acc += num; break;
                                case '-': acc -= num; break;
                                case '*': acc *= num; break;
                                case '/': acc /= num; break;
                            }
    
                            last = ' ';
                            i += 2;
                        }
                        break;
    
                    case '+': case '-': case '*': case '/':
                        operation = current;
                        i += 2;
                        break;
                }
            }
    
            if (last != ' ')
                switch (operation)
                {
                    case '+': acc += last - '0'; break;
                    case '-': acc -= last - '0'; break;
                    case '*': acc *= last - '0'; break;
                    case '/': acc /= last - '0'; break;
                }
    
            return acc;
        }
    

    I implemented one more variant:

        static int Calculate3(string input)
        {
            int acc = 0, i = 0;
            var operation = '+';
    
            while (true)
            {
                var a = input[i++] - '0';
                if (i == input.Length)
                {
                    switch (operation)
                    {
                        case '+': acc += a; break;
                        case '-': acc -= a; break;
                        case '*': acc *= a; break;
                        case '/': acc /= a; break;
                    }
    
                    break;
                }
    
                var b = input[i];
                if (b == ' ') i++;
                else
                {
                    a = a * 10 + (b - '0');
                    i += 2;
                }
    
                switch (operation)
                {
                    case '+': acc += a; break;
                    case '-': acc -= a; break;
                    case '*': acc *= a; break;
                    case '/': acc /= a; break;
                }
    
                if (i >= input.Length) break;
                operation = input[i];
                i += 2;
            }
    
            return acc;
        }
    

    Results in abstract points:

    • Calculate1 230
    • Calculate2 192
    • Calculate3 111
    0 讨论(0)
提交回复
热议问题