Are Javascript arrays sparse?

后端 未结 7 2101
情书的邮戳
情书的邮戳 2020-11-22 08:12

That is, if I use the current time as an index into the array:

array[Date.getTime()] = value;

will the interpreter instantiate all the elem

相关标签:
7条回答
  • 2020-11-22 08:39

    Sparseness (or denseness) can be confirmed empirically for NodeJS with the non-standard process.memoryUsage().

    Sometimes node is clever enough to keep the array sparse:

    Welcome to Node.js v12.15.0.
    Type ".help" for more information.
    > console.log(`The script is using approximately ${Math.round(process.memoryUsage().heapUsed / 1024 / 1024 * 100) / 100} MB`)
    The script is using approximately 3.07 MB
    undefined
    > array = []
    []
    > array[2**24] = 2**24
    16777216
    > array
    [ <16777216 empty items>, 16777216 ]
    > console.log(`The script is using approximately ${Math.round(process.memoryUsage().heapUsed / 1024 / 1024 * 100) / 100} MB`)
    The script is using approximately 2.8 MB
    undefined
    

    Sometimes node chooses to make it dense (this behavior might well be optimized in future):

    > otherArray = Array(2**24)
    [ <16777216 empty items> ]
    > console.log(`The script is using approximately ${Math.round(process.memoryUsage().heapUsed / 1024 / 1024 * 100) / 100} MB`)
    The script is using approximately 130.57 MB
    undefined
    

    Then sparse again:

    > yetAnotherArray = Array(2**32-1)
    [ <4294967295 empty items> ]
    > console.log(`The script is using approximately ${Math.round(process.memoryUsage().heapUsed / 1024 / 1024 * 100) / 100} MB`)
    The script is using approximately 130.68 MB
    undefined
    

    So perhaps using a dense array to get a feel for the original AIX kernel bug might need to be forced with a range-alike:

    > denseArray = [...Array(2**24).keys()]
    [
       0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11,
      12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23,
      24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35,
      36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47,
      48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59,
      60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71,
      72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83,
      84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95,
      96, 97, 98, 99,
      ... 16777116 more items
    ]
    > console.log(`The script is using approximately ${Math.round(process.memoryUsage().heapUsed / 1024 / 1024 * 100) / 100} MB`);
    The script is using approximately 819.94 MB
    undefined
    

    Because why not make it fall over?

    > tooDenseArray = [...Array(2**32-1).keys()]
    
    <--- Last few GCs --->
    
    [60109:0x1028ca000]   171407 ms: Scavenge 1072.7 (1090.0) -> 1056.7 (1090.0) MB, 0.2 / 0.0 ms  (average mu = 0.968, current mu = 0.832) allocation failure 
    [60109:0x1028ca000]   171420 ms: Scavenge 1072.7 (1090.0) -> 1056.7 (1090.0) MB, 0.2 / 0.0 ms  (average mu = 0.968, current mu = 0.832) allocation failure 
    [60109:0x1028ca000]   171434 ms: Scavenge 1072.7 (1090.0) -> 1056.7 (1090.0) MB, 0.2 / 0.0 ms  (average mu = 0.968, current mu = 0.832) allocation failure 
    
    
    <--- JS stacktrace --->
    
    ==== JS stack trace =========================================
    
        0: ExitFrame [pc: 0x100931399]
        1: StubFrame [pc: 0x1008ee227]
        2: StubFrame [pc: 0x100996051]
    Security context: 0x1043830808a1 <JSObject>
        3: /* anonymous */ [0x1043830b6919] [repl:1] [bytecode=0x1043830b6841 offset=28](this=0x104306fc2261 <JSGlobal Object>)
        4: InternalFrame [pc: 0x1008aefdd]
        5: EntryFrame [pc: 0x1008aedb8]
        6: builtin exit frame: runInThisContext(this=0x104387b8cac1 <ContextifyScript map = 0x1043...
    
    FATAL ERROR: invalid array length Allocation failed - JavaScript heap out of memory
    
    Writing Node.js report to file: report.20200220.220620.60109.0.001.json
    Node.js report completed
     1: 0x10007f4b9 node::Abort() [/Users/pzrq/.nvm/versions/node/v12.15.0/bin/node]
     2: 0x10007f63d node::OnFatalError(char const*, char const*) [/Users/pzrq/.nvm/versions/node/v12.15.0/bin/node]
     3: 0x100176a27 v8::Utils::ReportOOMFailure(v8::internal::Isolate*, char const*, bool) [/Users/pzrq/.nvm/versions/node/v12.15.0/bin/node]
     4: 0x1001769c3 v8::internal::V8::FatalProcessOutOfMemory(v8::internal::Isolate*, char const*, bool) [/Users/pzrq/.nvm/versions/node/v12.15.0/bin/node]
     5: 0x1002fab75 v8::internal::Heap::FatalProcessOutOfMemory(char const*) [/Users/pzrq/.nvm/versions/node/v12.15.0/bin/node]
     6: 0x1005f3e9b v8::internal::Runtime_FatalProcessOutOfMemoryInvalidArrayLength(int, unsigned long*, v8::internal::Isolate*) [/Users/pzrq/.nvm/versions/node/v12.15.0/bin/node]
     7: 0x100931399 Builtins_CEntry_Return1_DontSaveFPRegs_ArgvOnStack_NoBuiltinExit [/Users/pzrq/.nvm/versions/node/v12.15.0/bin/node]
     8: 0x1008ee227 Builtins_IterableToList [/Users/pzrq/.nvm/versions/node/v12.15.0/bin/node]
    Abort trap: 6
    
    0 讨论(0)
  • 2020-11-22 08:43

    Yes, they are. They are actually hash tables internally, so you can use not only large integers but also strings, floats, or other objects. All keys get converted to strings via toString() before being added to the hash. You can confirm this with some test code:

    <script>
      var array = [];
      array[0] = "zero";
      array[new Date().getTime()] = "now";
      array[3.14] = "pi";
    
      for (var i in array) {
          alert("array["+i+"] = " + array[i] + ", typeof("+i+") == " + typeof(i));
      }
    </script>
    

    Displays:

    array[0] = zero, typeof(0) == string
    array[1254503972355] = now, typeof(1254503972355) == string
    array[3.14] = pi, typeof(3.14) == string
    

    Notice how I used for...in syntax, which only gives you the indices that are actually defined. If you use the more common for (var i = 0; i < array.length; ++i) style of iteration then you will obviously have problems with non-standard array indices.

    0 讨论(0)
  • 2020-11-22 08:43

    Javascript objects are sparse, and arrays are just specialized objects with an auto-maintained length property (which is actually one larger than the largest index, not the number of defined elements) and some additional methods. You are safe either way; use an array if you need it's extra features, and an object otherwise.

    0 讨论(0)
  • 2020-11-22 08:50

    How exactly JavaScript arrays are implemented differs from browser to browser, but they generally fall back to a sparse implementation - most likely the same one used for property access of regular objects - if using an actual array would be inefficient.

    You'll have to ask someone with more knowledge about specific implementations to answer what excatly triggers the shift from dense to sparse, but your example should be perfectly safe. If you want to get a dense array, you should call the constructor with an explicit length argument and hope you'll actually get one.

    See this answer for a more detailed description by olliej.

    0 讨论(0)
  • 2020-11-22 08:52

    You could avoid the issue by using a javascript syntax designed for this sort of thing. You can treat it as a dictionary, yet the "for ... in ... " syntax will let you grab them all.

    var sparse = {}; // not []
    sparse["whatever"] = "something";
    
    0 讨论(0)
  • 2020-11-22 08:58

    The answer, as is usually true with JavaScript, is "it's a bit wierder...."

    Memory usage is not defined and any implementation is allowed to be stupid. In theory, const a = []; a[1000000]=0; could burn megabytes of memory, as could const a = [];. In practice, even Microsoft avoids those implementations.

    Justin Love points out, the length attribute is the highest index set. BUT its only updated if the index is an integer.

    So, the array is sparse. BUT built-in functions like reduce(), Math.max(), and "for ... of" will walk through the entire range of possible integer indices form 0 to the length, visiting many that return 'undefined'. BUT 'for ... in' loops might do as you expect, visiting only the defined keys.

    Here's an example using Node.js:

    "use strict";
    const print = console.log;
    
    let a = [0, 10];
    // a[2] and a[3] skipped
    a[4] = 40;
    a[5] = undefined;  // which counts towards setting the length
    a[31.4] = 'ten pi';  // doesn't count towards setting the length
    a['pi'] = 3.14;
    print(`a.length= :${a.length}:, a = :${a}:`);
    print(`Math.max(...a) = :${Math.max(a)}: because of 'undefined values'`);
    for (let v of a) print(`v of a; v=:${v}:`);
    for (let i in a) print(`i in a; i=:${i}: a[i]=${a[i]}`);
    

    giving:

    a.length= :6:, a = :0,10,,,40,:
    Math.max(...a) = :NaN: because of 'undefined values'
    v of a; v=:0:
    v of a; v=:10:
    v of a; v=:undefined:
    v of a; v=:undefined:
    v of a; v=:40:
    v of a; v=:undefined:
    i in a; i=:0: a[i]=0
    i in a; i=:1: a[i]=10
    i in a; i=:4: a[i]=40
    i in a; i=:5: a[i]=undefined
    i in a; i=:31.4: a[i]=ten pi
    i in a; i=:pi: a[i]=3.14
    

    But. There are more corner cases with Arrays not yet mentioned.

    0 讨论(0)
提交回复
热议问题