Optimizing strings in Cython

后端 未结 1 1863
挽巷
挽巷 2021-02-14 12:31

I\'m trying to demonstrate to our group the virtues of Cython for enhancing Python performance. I have shown several benchmarks, all that attain speed up by just:

1条回答
  •  清歌不尽
    2021-02-14 13:19

    I suggest you do your operations on cpython.array.arrays. The best documentation is the C API and the Cython source (see here).

    from cpython cimport array
    
    def cfuncA():
        cdef str a
        cdef int i,j
        for j in range(1000):
            a = ''.join([chr(i) for i in range(127)])
    
    def cfuncB():
        cdef:
            str a
            array.array[char] arr, template = array.array('c')
            int i, j
    
        for j in range(1000):
            arr = array.clone(template, 127, False)
    
            for i in range(127):
                arr[i] = i
    
            a = arr.tostring()
    

    Note that the operations required vary very much on what you do to your strings.

    >>> python2 -m timeit -s "import pyximport; pyximport.install(); import cyytn" "cyytn.cfuncA()"
    100 loops, best of 3: 14.3 msec per loop
    
    >>> python2 -m timeit -s "import pyximport; pyximport.install(); import cyytn" "cyytn.cfuncB()"
    1000 loops, best of 3: 512 usec per loop
    

    So that's a 30x speed-up in this case.


    Also, FWIW, you can take off another fair few µs by replacing arr.tostring() with arr.data.as_chars[:len(arr)] and typing a as bytes.

    0 讨论(0)
提交回复
热议问题