faster strlen?

后端 未结 8 2383
小鲜肉
小鲜肉 2021-02-20 15:49

Typical strlen() traverse from first character till it finds \\0. This requires you to traverse each and every character. In algorithm sense, its O(N).

相关标签:
8条回答
  • 2021-02-20 16:22

    Actually, glibc's implementation of strlen is an interesting example of the vectorization approach. It is peculiar in that it doesn't use vector instructions, but finds a way to use only ordinary instructions on 32 or 64 bits words from the buffer.

    0 讨论(0)
  • 2021-02-20 16:29

    Here I attached the asm code from glibc 2.29. I removed the snippet for ARM cpus. I tested it, it is really fast, beyond my expectation. It merely do alignment then 4 bytes comparison.

    ENTRY(strlen)
    bic     r1, r0, $3              @ addr of word containing first byte
    ldr     r2, [r1], $4            @ get the first word
    ands    r3, r0, $3              @ how many bytes are duff?
    rsb     r0, r3, $0              @ get - that number into counter.
    beq     Laligned                @ skip into main check routine if no more
    orr     r2, r2, $0x000000ff     @ set this byte to non-zero
    subs    r3, r3, $1              @ any more to do?
    orrgt   r2, r2, $0x0000ff00     @ if so, set this byte
    subs    r3, r3, $1              @ more?
    orrgt   r2, r2, $0x00ff0000     @ then set.
    Laligned:               @ here, we have a word in r2.  Does it
    tst     r2, $0x000000ff         @ contain any zeroes?
    tstne   r2, $0x0000ff00         @
    tstne   r2, $0x00ff0000         @
    tstne   r2, $0xff000000         @
    addne   r0, r0, $4              @ if not, the string is 4 bytes longer
    ldrne   r2, [r1], $4            @ and we continue to the next word
    bne     Laligned                @
    Llastword:              @ drop through to here once we find a
    tst     r2, $0x000000ff         @ word that has a zero byte in it
    addne   r0, r0, $1              @
    tstne   r2, $0x0000ff00         @ and add up to 3 bytes on to it
    addne   r0, r0, $1              @
    tstne   r2, $0x00ff0000         @ (if first three all non-zero, 4th
    addne   r0, r0, $1              @  must be zero)
    DO_RET(lr)
    

    END(strlen)

    0 讨论(0)
提交回复
热议问题