c# float [] average loses accuracy

后端 未结 3 1553
遇见更好的自我
遇见更好的自我 2021-01-21 08:01

I am trying to calculate average for an array of floats. I need to use indices because this is inside a binary search so the top and bottom will move. (Big picture we are trying

相关标签:
3条回答
  • 2021-01-21 08:37

    I'd rewrite this as:

    int count = (top - bottom) + 1;//number of elements in this iteration
    double sum = 0;
    for(int i = bottom; i <= top; i++)
    {
         sum += input[i];
    }
    float average = (float)(sum/count);
    

    That way you're using a high precision accumulator, which helps reduce rounding errors.

    btw. if performance isn't that important, you can still use LINQ to calculate the average of an array slice:

    input.Skip(bottom).Take(top - bottom + 1).Average()
    

    I'm not entirely sure if that fits your problem, but if you need to calculate the average of many subarrays, it can be useful to create a persistent sum array, so calculating an average simply becomes two table lookups and a division.

    0 讨论(0)
  • 2021-01-21 08:51

    Just to add to the conversation, be careful when using Floating point primitives.

    What Every Computer Scientist Should Know About Floating-Point Arithmetic

    Internally floating point numbers store additional least significant bits that are not reflected in the displayed value (aka: Guard Bits or Guard Digits). They are, however, utilized when performing mathematical operations and equality checks. One common result is that a variable containing 0f is not always zero. When accumulating floating point values this can also lead to precision errors.

    Use Decimal for your accumulator:

    1. Will not have rounding errors due to Guard Digits
    2. Is a 128bit data type (less likely to exceed Max Value in your accumulator).

    For more info: What is the difference between Decimal, Float and Double in C#?

    0 讨论(0)
  • 2021-01-21 08:55

    I'm getting 2 places less accuracy than the c# Average()

    No, you are only losing 1 significant digit. The float type can only store 7 significant digits, the rest are just random noise. Inevitably in a calculation like this, you can accumulate round-off error and thus lose precision. Getting the round-off errors to balance out requires luck.

    The only way to avoid it is to use a floating point type that has more precision to accumulate the result. Not an issue, you have double available. Which is why the Linq Average method looks like this:

       public static float Average(this IEnumerable<float> source) {
           if (source == null) throw Error.ArgumentNull("source");
           double sum = 0;         // <=== NOTE: double
           long count = 0;
           checked {
               foreach (float v in source) {
                   sum += v;
                   count++;
               }
           }
           if (count > 0) return (float)(sum / count);
           throw Error.NoElements();
       }
    

    Use double to reproduce the Linq result with a comparable number of significant digits in the result.

    0 讨论(0)
提交回复
热议问题