Split value in 24 randomly sized parts using C#

前端 未结 11 1092
忘了有多久
忘了有多久 2021-02-03 12:01

I have a value, say 20010. I want to randomly divide this value over 24 hours. So basically split the value into a 24 slot big array where all slots are randomly big.

Wh

相关标签:
11条回答
  • 2021-02-03 12:35

    I've calculated the average size of each of the 24 buckets over 100 trials for each of the algorithms proposed here. I thought it was interesting that three out of the four do seem to result in 20010/24 items per bucket on average, but the naive method I described converges to that average most quickly. This makes some intuitive sense to me. That method is something like snowing randomly on 24 buckets, and thus likely to result in buckets which are approximately equal in size. The others are more like hacking randomly at a length of wood.

    Bevan:  [751, 845, 809, 750, 887, 886, 838, 868, 837, 902, 841, 812, 818, 774, 815, 857, 752, 815, 896, 872, 833, 864, 769, 894]
    Gregory:  [9633, 5096, 2623, 1341, 766, 243, 159, 65, 21, 19, 16, 4, 3, 2, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2]
    mjv:  [895, 632, 884, 837, 799, 722, 853, 749, 915, 756, 814, 863, 842, 642, 820, 805, 659, 862, 742, 812, 768, 816, 721, 940]
    peterallenwebb:  [832, 833, 835, 829, 833, 832, 837, 835, 833, 827, 833, 832, 834, 833, 836, 833, 838, 834, 834, 833, 834, 832, 836, 830]
    

    And here is the python code: import random

    N = 20010;
    
    def mjv():
        gaps = [ random.randrange(0, N) for i in range(0, 24) ]
        gaps = gaps + [0, N]
        gaps.sort()
        value = [ gaps[i+1] - gaps[i] for i in range(0, 24) ]
        return value
    
    def gregory():
        values = []
        remainingPortion = N
    
        for i in range(0, 23):
            val = random.randrange(1, remainingPortion - (23 - i))
            remainingPortion = remainingPortion - val
            values.append(val)
    
        values.append(remainingPortion)
    
        return values
    
    
    def peterallenwebb():
        values = [0  for i in range(0, 24) ]
    
        for i in range(0, N):
            k = random.randrange(0, 24)
            values[k] = values[k] + 1 
    
        return values
    
    def bevan():
        values = [];
        sum = 0.0
    
        for i in range(0, 24):
            k = random.random()
            sum = sum + k
            values.append(k);
    
        scaleFactor = N / sum
    
        for j in range(0, 24):
            values[j] = int(values[j] * scaleFactor)
    
        return values
    
    
    def averageBucketSizes(method):
        totals = [0 for i in range(0, 24)]
        trials = 100
    
        for i in range(0,trials):
            values = method()
    
            for j in range(0, 24):
                totals[j] = totals[j] + values[j]      
    
        for j in range(0, 24):
            totals[j] = totals[j] / trials
    
        return totals;
    
    
    print 'Bevan: ', averageBucketSizes(bevan)
    print 'Gregory: ', averageBucketSizes(gregory)
    print 'mjv: ', averageBucketSizes(mjv)
    print 'peterallenwebb: ', averageBucketSizes(peterallenwebb)
    

    Let me know if you see any mistakes. I will re-run.

    0 讨论(0)
  • 2021-02-03 12:35

    If you want to be sure that you're not biasing the process without much analysis, you could just create a 24 element array, initialize each element to 0 and then add 1 to one of the elements at random 20010 times.

    It all depends on the kind of distributions you want to see, but I don't think any of the other techniques recommend so far will result in the hour-long "buckets" being statistically indistinguishable.

    0 讨论(0)
  • 2021-02-03 12:40

    This will give you a somewhat "decreasing" randomness the higher the index becomes. You can randomise the list positions if required? It depends on what you need to do with it.

    int initialValue = 20010;
    var values = new List<int>();
    
    Random rnd = new Random();
    int currentRemainder = initialValue;
    
    for (int i = 0; i < 21; i++)
    {
        //get a new value;
        int val = rnd.Next(1, currentRemainder - (21 - i));
    
        currentRemainder -= val;
        values.Add(val);
    }
    
    values.Add(currentRemainder);
    
    //initialValue == values.Sum()
    
    0 讨论(0)
  • 2021-02-03 12:43

    Draw 23 (not 24) numbers at random, (without duplicates), in the range 1 to 20009. Add 0 and 20010 the list and order these numbers, the difference between each two consecutive number gives you one slot value.

    An online approach is also possible by drawing one value at a time and subtracting it from the "pot", drawing anew when the number is bigger than the amount left. This approach however may lead to a greater deviation of the size of the slots.

    0 讨论(0)
  • 2021-02-03 12:43

    Here's a functional solution using mjv's algorithm:

    static int[] GetSlots(int slots, int max)
    {
        return new Random().Values(1, max)
                           .Take(slots - 1)
                           .Append(0, max)
                           .OrderBy(i => i)
                           .Pairwise((x, y) => y - x)
                           .ToArray();
    }
    
    public static IEnumerable<int> Values(this Random random, int minValue, int maxValue)
    {
        while (true)
            yield return random.Next(minValue, maxValue);
    }
    
    public static IEnumerable<TResult> Pairwise<TSource, TResult>(this IEnumerable<TSource> source, Func<TSource, TSource, TResult> resultSelector)
    {
        TSource previous = default(TSource);
    
        using (var it = source.GetEnumerator())
        {
            if (it.MoveNext())
                previous = it.Current;
    
            while (it.MoveNext())
                yield return resultSelector(previous, previous = it.Current);
        }
    }
    
    public static IEnumerable<T> Append<T>(this IEnumerable<T> source, params T[] args)
    {
        return source.Concat(args);
    }
    
    0 讨论(0)
提交回复
热议问题