What is the BigO of Swift's String.count?

前端 未结 3 1179
余生分开走
余生分开走 2021-01-03 05:08

When swift uses String.count is it:

O(n) where each time we call it we iterate through the entire String in order to count it

相关标签:
3条回答
  • 2021-01-03 05:19

    After failing to find documentation on this or being able to find this function within the source code I tested this myself using performance tests as described below. It assumed O(1) was possible based on PHP's Array being O(1). Swifts String.count function appears to be O(n).

    Results

    Is count cached when it's been called before now? (no)

    I also tested to see if calling String.count once would cache it. By comparing results when count has already been called and when it has been stored to a variable to ensure it's not being stored prior to calling .count in our normal tests.

    Tests

    import XCTest
    
    class CountTests: XCTestCase {
    
        func test100K() {
            let testString = String(repeating: "a", count: 100000)
            self.measure {
                _ = testString.count
            }
        }
    
        func test1000K() {
            let testString = String(repeating: "a", count: 1000000)
            self.measure {
                _ = testString.count
            }
        }
    
        func test10000K() {
            let testString = String(repeating: "a", count: 10000000)
            self.measure {
                _ = testString.count
            }
        }
    
        func test10000KCached() {
            let testString = String(repeating: "a", count: 10000000)
            _ = testString.count
            self.measure {
                _ = testString.count
            }
        }
    
        func test10000KStrong() {
            let testString = String(repeating: "a", count: 10000000)
            let count = testString.count
            self.measure {
                _ = count
            }
        }
    }
    
    0 讨论(0)
  • 2021-01-03 05:32

    Looks like O(n) to me based on a quick Playground test.

    for step in 1...10 {
        let length = step * 100000
        let string = String(repeating: "x", count: length)
        let start = Date()
        let stringLength = string.count
        let end = Date()
        print("Length: \(stringLength), time: \(end.timeIntervalSince(start))")
    }
    
    // Length: 100000, time: 0.00178205966949463
    // Length: 200000, time: 0.00132298469543457
    // Length: 300000, time: 0.00184988975524902
    // Length: 400000, time: 0.00218689441680908
    // Length: 500000, time: 0.00302803516387939
    // Length: 600000, time: 0.00368499755859375
    // Length: 700000, time: 0.0039069652557373
    // Length: 800000, time: 0.00444602966308594
    // Length: 900000, time: 0.0052180290222168
    // Length: 1000000, time: 0.00539696216583252
    
    0 讨论(0)
  • 2021-01-03 05:40

    It is definitely O(n). From the Swift Book:

    As a result, the number of characters in a string can't be calculated without iterating through the string to determine its extended grapheme cluster boundaries. If you are working with particularly long string values, be aware that the count property must iterate over the Unicode scalars in the entire string in order to determine the characters for that string.

    This has a few implications, the biggest of which is integer subscripting (i.e. str[5]) is not available through the standard library. Internally, String uses ASCII or UTF-16 encoding (from Swift 5, it uses UTF-8 only). If the string only uses ASCII characters then count can be O(1) but ASCII only has 127 characters so consider this an exception rather than the rule.

    NSString, on the other hand, always uses UTF-16 so accessing its length is O(1). And also keep in mind that NSString.length != String.count (try strings with emojis and you'll see).

    As for your second question, it does not cache count for subsequent calls. Every call to count is thus O(n), even if the string has not changed. The code in the Foundation repo also confirms that.

    0 讨论(0)
提交回复
热议问题