What is the difference between pandas.qcut and pandas.cut?

前端 未结 4 1316
轻奢々
轻奢々 2020-11-29 16:02

The documentation says:

http://pandas.pydata.org/pandas-docs/dev/basics.html

\"Continuous values can be discretized using the cut (bins based on values) an

4条回答
  •  有刺的猬
    2020-11-29 16:33

    • cut command creates equispaced bins but frequency of samples is unequal in each bin
    • qcut command creates unequal size bins but frequency of samples is equal in each bin.

        >>> x=np.array([24,  7,  2, 25, 22, 29])
        >>> x
        array([24,  7,  2, 25, 22, 29])
    
        >>> pd.cut(x,3).value_counts() #Bins size has equal interval of 9
        (2, 11.0]        2
        (11.0, 20.0]     0
        (20.0, 29.0]     4
    
        >>> pd.qcut(x,3).value_counts() #Equal frequecy of 2 in each bins
        (1.999, 17.0]     2
        (17.0, 24.333]    2
        (24.333, 29.0]    2
    

提交回复
热议问题